连接语音的自动计算上下文敏感特征提高了对阿尔茨海默病损伤的预测。

IF 2.2
Graham Flick, Rachel Ostrand
{"title":"连接语音的自动计算上下文敏感特征提高了对阿尔茨海默病损伤的预测。","authors":"Graham Flick, Rachel Ostrand","doi":"10.1044/2025_JSLHR-24-00297","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Early detection is critical for effective management of Alzheimer's disease (AD) and other dementias. One promising approach for predicting AD status is to automatically calculate linguistic features from open-ended connected speech. Past work has focused on individual word-level features such as part of speech counts, total word production, and lexical richness, with less emphasis on measuring the relationship between words and the context in which they are produced. Here, we assessed whether linguistic features that take into account where a word was produced in the discourse context improved the ability to predict AD patients' Mini-Mental State Examination (MMSE) scores and classify AD patients from healthy control participants.</p><p><strong>Method: </strong>Seventeen linguistic features were automatically computed from transcriptions of spoken picture descriptions from individuals with probable or possible AD (<i>n</i> = 176 transcripts). This included 12 word-level features (e.g., part of speech counts) and five features capturing contextual word choices (linguistic surprisal, computed from a computational large language model, and properties of words produced following filled pauses). We examined whether (a) the full set jointly predicted MMSE scores, (b) the addition of contextual features improved prediction, and (c) linguistic features could classify AD patients (<i>n</i> = 130) versus healthy participants (<i>n</i> = 93).</p><p><strong>Results: </strong>Linguistic features accurately predicted MMSE scores in individuals with probable or possible AD and successfully identified up to 87% of AD participants versus healthy controls. Statistical models that contained linguistic surprisal (a contextual feature) performed better than those that included only word-level and demographic features. Overall, AD patients with lower MMSE scores produced more empty words, fewer nouns and definite articles, and words that were higher frequency yet more surprising given the previous context.</p><p><strong>Conclusion: </strong>These results provide novel evidence that metrics related to contextualized word choices, particularly the surprisal of an individual's words, capture variance in degree of cognitive decline in AD.</p>","PeriodicalId":520690,"journal":{"name":"Journal of speech, language, and hearing research : JSLHR","volume":" ","pages":"1-22"},"PeriodicalIF":2.2000,"publicationDate":"2025-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automatically Calculated Context-Sensitive Features of Connected Speech Improve Prediction of Impairment in Alzheimer's Disease.\",\"authors\":\"Graham Flick, Rachel Ostrand\",\"doi\":\"10.1044/2025_JSLHR-24-00297\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Purpose: </strong>Early detection is critical for effective management of Alzheimer's disease (AD) and other dementias. One promising approach for predicting AD status is to automatically calculate linguistic features from open-ended connected speech. Past work has focused on individual word-level features such as part of speech counts, total word production, and lexical richness, with less emphasis on measuring the relationship between words and the context in which they are produced. Here, we assessed whether linguistic features that take into account where a word was produced in the discourse context improved the ability to predict AD patients' Mini-Mental State Examination (MMSE) scores and classify AD patients from healthy control participants.</p><p><strong>Method: </strong>Seventeen linguistic features were automatically computed from transcriptions of spoken picture descriptions from individuals with probable or possible AD (<i>n</i> = 176 transcripts). This included 12 word-level features (e.g., part of speech counts) and five features capturing contextual word choices (linguistic surprisal, computed from a computational large language model, and properties of words produced following filled pauses). We examined whether (a) the full set jointly predicted MMSE scores, (b) the addition of contextual features improved prediction, and (c) linguistic features could classify AD patients (<i>n</i> = 130) versus healthy participants (<i>n</i> = 93).</p><p><strong>Results: </strong>Linguistic features accurately predicted MMSE scores in individuals with probable or possible AD and successfully identified up to 87% of AD participants versus healthy controls. Statistical models that contained linguistic surprisal (a contextual feature) performed better than those that included only word-level and demographic features. Overall, AD patients with lower MMSE scores produced more empty words, fewer nouns and definite articles, and words that were higher frequency yet more surprising given the previous context.</p><p><strong>Conclusion: </strong>These results provide novel evidence that metrics related to contextualized word choices, particularly the surprisal of an individual's words, capture variance in degree of cognitive decline in AD.</p>\",\"PeriodicalId\":520690,\"journal\":{\"name\":\"Journal of speech, language, and hearing research : JSLHR\",\"volume\":\" \",\"pages\":\"1-22\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2025-10-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of speech, language, and hearing research : JSLHR\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1044/2025_JSLHR-24-00297\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of speech, language, and hearing research : JSLHR","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1044/2025_JSLHR-24-00297","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

目的:早期发现是有效管理阿尔茨海默病(AD)和其他痴呆症的关键。一种很有前途的预测AD状态的方法是自动计算开放式连接语音的语言特征。过去的研究主要集中在单个词级特征上,如词性计数、总词量和词汇丰富度,而较少强调测量词与产生词的上下文之间的关系。在这里,我们评估了考虑话语上下文中单词产生位置的语言特征是否提高了预测AD患者迷你精神状态检查(MMSE)分数和将AD患者与健康对照参与者分类的能力。方法:从可能或可能患有AD的个体的口头图片描述转录中自动计算17个语言特征(n = 176)。这包括12个单词级别的特征(例如,词性计数)和5个捕获上下文单词选择的特征(语言惊讶度,从计算大型语言模型中计算出来,以及填充停顿后产生的单词属性)。我们研究了(a)完整的集合是否联合预测MMSE评分,(b)添加上下文特征改进预测,以及(c)语言特征是否可以将AD患者(n = 130)与健康参与者(n = 93)进行分类。结果:语言特征准确地预测了可能或可能患有AD的个体的MMSE评分,并成功地识别了87%的AD参与者与健康对照组。包含语言惊喜(上下文特征)的统计模型比只包含单词水平和人口特征的统计模型表现得更好。总体而言,MMSE得分较低的AD患者会产生更多的空词,更少的名词和定冠词,以及频率更高但更令人惊讶的词。结论:这些结果提供了新的证据,表明与语境化词汇选择相关的指标,特别是个体词汇的惊讶程度,反映了AD患者认知能力下降程度的差异。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Automatically Calculated Context-Sensitive Features of Connected Speech Improve Prediction of Impairment in Alzheimer's Disease.

Purpose: Early detection is critical for effective management of Alzheimer's disease (AD) and other dementias. One promising approach for predicting AD status is to automatically calculate linguistic features from open-ended connected speech. Past work has focused on individual word-level features such as part of speech counts, total word production, and lexical richness, with less emphasis on measuring the relationship between words and the context in which they are produced. Here, we assessed whether linguistic features that take into account where a word was produced in the discourse context improved the ability to predict AD patients' Mini-Mental State Examination (MMSE) scores and classify AD patients from healthy control participants.

Method: Seventeen linguistic features were automatically computed from transcriptions of spoken picture descriptions from individuals with probable or possible AD (n = 176 transcripts). This included 12 word-level features (e.g., part of speech counts) and five features capturing contextual word choices (linguistic surprisal, computed from a computational large language model, and properties of words produced following filled pauses). We examined whether (a) the full set jointly predicted MMSE scores, (b) the addition of contextual features improved prediction, and (c) linguistic features could classify AD patients (n = 130) versus healthy participants (n = 93).

Results: Linguistic features accurately predicted MMSE scores in individuals with probable or possible AD and successfully identified up to 87% of AD participants versus healthy controls. Statistical models that contained linguistic surprisal (a contextual feature) performed better than those that included only word-level and demographic features. Overall, AD patients with lower MMSE scores produced more empty words, fewer nouns and definite articles, and words that were higher frequency yet more surprising given the previous context.

Conclusion: These results provide novel evidence that metrics related to contextualized word choices, particularly the surprisal of an individual's words, capture variance in degree of cognitive decline in AD.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信