Automatic temporal analysis of speech: A quick and objective pipeline for the assessment of overt stuttering.

IF 4.6 2区 心理学 Q1 PSYCHOLOGY, EXPERIMENTAL
Vishruta Yawatkar, Ho Ming Chow, Evan Usler
{"title":"Automatic temporal analysis of speech: A quick and objective pipeline for the assessment of overt stuttering.","authors":"Vishruta Yawatkar, Ho Ming Chow, Evan Usler","doi":"10.3758/s13428-025-02733-z","DOIUrl":null,"url":null,"abstract":"<p><p>Fluency disorders, such as developmental stuttering, have been characterized by behavior such as blocks, repetitions, and prolongations in speech. Accurate measurement of overt stuttering behavior can aid in diagnostic evaluation and the determination of optimal treatment for this disorder. This study proposes a method - Automatic Temporal Analysis of Speech (ATAS) - for the assessment of speech fluency based on the detection and quantification of discrete pauses and vocal events. Our ATAS metrics include speech rate, total pause time, pause count, mean pause duration, mean vocal duration, pause duration variability, and vocal duration variability. We used oral reading audio samples from a total of 35 English-speaking participants: 17 from adults who stutter (AWS) and 18 from adults who do not stutter (AWNS). AWS, in general, exhibited more pausing or hesitancy in speech compared to AWNS, as evidenced by slower speech rate, greater total pause time, higher pause count, and longer mean duration of pause events. Numerous pause and vocal metrics acquired from ATAS were correlated with a canonical measure of stuttering frequency percent syllables stuttered, suggesting that automatically detected temporal metrics of pause and vocal events within continuous speech are highly associated with overt stuttering behavior. ATAS metrics generally predicted the status of each participant as either an AWS or AWNS grouping with accuracies considerably higher than random guessing using random forest and LSTM classifiers. This pipeline may provide an alternative and complementary method that speech-language pathologists and other health professionals can use in the assessment of fluency disorders.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 8","pages":"228"},"PeriodicalIF":4.6000,"publicationDate":"2025-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12274223/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Behavior Research Methods","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.3758/s13428-025-02733-z","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0

Abstract

Fluency disorders, such as developmental stuttering, have been characterized by behavior such as blocks, repetitions, and prolongations in speech. Accurate measurement of overt stuttering behavior can aid in diagnostic evaluation and the determination of optimal treatment for this disorder. This study proposes a method - Automatic Temporal Analysis of Speech (ATAS) - for the assessment of speech fluency based on the detection and quantification of discrete pauses and vocal events. Our ATAS metrics include speech rate, total pause time, pause count, mean pause duration, mean vocal duration, pause duration variability, and vocal duration variability. We used oral reading audio samples from a total of 35 English-speaking participants: 17 from adults who stutter (AWS) and 18 from adults who do not stutter (AWNS). AWS, in general, exhibited more pausing or hesitancy in speech compared to AWNS, as evidenced by slower speech rate, greater total pause time, higher pause count, and longer mean duration of pause events. Numerous pause and vocal metrics acquired from ATAS were correlated with a canonical measure of stuttering frequency percent syllables stuttered, suggesting that automatically detected temporal metrics of pause and vocal events within continuous speech are highly associated with overt stuttering behavior. ATAS metrics generally predicted the status of each participant as either an AWS or AWNS grouping with accuracies considerably higher than random guessing using random forest and LSTM classifiers. This pipeline may provide an alternative and complementary method that speech-language pathologists and other health professionals can use in the assessment of fluency disorders.

言语的自动时间分析:一种快速客观的评估显性口吃的方法。
流利性障碍,如发育性口吃,其特征是说话时语无伦次、重复和拖延。准确测量明显的口吃行为有助于诊断评估和确定最佳治疗方法。本研究提出了一种方法-语音自动时间分析(ATAS) -基于离散停顿和语音事件的检测和量化来评估语音流畅性。我们的ATAS指标包括语速、总暂停时间、暂停计数、平均暂停时间、平均声音持续时间、暂停持续时间可变性和声音持续时间可变性。我们使用了来自35名说英语的参与者的口语阅读音频样本:17名来自口吃的成年人(AWS), 18名来自不口吃的成年人(AWNS)。一般来说,与自动语音识别系统相比,自动语音识别系统在语音中表现出更多的停顿或犹豫,这可以从更慢的语音速率、更长的总停顿时间、更高的停顿次数和更长的平均停顿事件持续时间中得到证明。从ATAS中获得的许多停顿和声音指标与口吃频率百分比的标准测量相关,这表明在连续讲话中自动检测到的停顿和声音事件的时间指标与明显的口吃行为高度相关。ATAS指标通常预测每个参与者作为AWS或AWNS分组的状态,其准确性大大高于使用随机森林和LSTM分类器的随机猜测。这条管道可能为语言病理学家和其他健康专业人员提供了一种替代和补充的方法,可以用于评估流利性障碍。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
10.30
自引率
9.30%
发文量
266
期刊介绍: Behavior Research Methods publishes articles concerned with the methods, techniques, and instrumentation of research in experimental psychology. The journal focuses particularly on the use of computer technology in psychological research. An annual special issue is devoted to this field.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信