使用自动语音识别的语言特征进行可解释的阿尔茨海默病检测。

IF 2.2 4区医学 Q3 CLINICAL NEUROLOGY

Dementia and Geriatric Cognitive Disorders Pub Date : 2023-01-01 Epub Date: 2023-07-11 DOI:10.1159/000531818

Lijuan Tang, Zhenglin Zhang, Feifan Feng, Li-Zhuang Yang, Hai Li

{"title":"使用自动语音识别的语言特征进行可解释的阿尔茨海默病检测。","authors":"Lijuan Tang, Zhenglin Zhang, Feifan Feng, Li-Zhuang Yang, Hai Li","doi":"10.1159/000531818","DOIUrl":null,"url":null,"abstract":"Introduction: Alzheimer's disease (AD) is the most prevalent type of dementia and can cause abnormal cognitive function and progressive loss of essential life skills. Early screening is thus necessary for the prevention and intervention of AD. Speech dysfunction is an early onset symptom of AD patients. Recent studies have demonstrated the promise of automated acoustic assessment using acoustic or linguistic features extracted from speech. However, most previous studies have relied on manual transcription of text to extract linguistic features, which weakens the efficiency of automated assessment. The present study thus investigates the effectiveness of automatic speech recognition (ASR) in building an end-to-end automated speech analysis model for AD detection.Methods: We implemented three publicly available ASR engines and compared the classification performance using the ADReSS-IS2020 dataset. Besides, the SHapley Additive exPlanations algorithm was then used to identify critical features that contributed most to model performance.Results: Three automatic transcription tools obtained mean word error rate texts of 32%, 43%, and 40%, respectively. These automated texts achieved similar or even better results than manual texts in model performance for detecting dementia, achieving classification accuracies of 89.58%, 83.33%, and 81.25%, respectively.Conclusion: Our best model, using ensemble learning, is comparable to the state-of-the-art manual transcription-based methods, suggesting the possibility of an end-to-end medical assistance system for AD detection with ASR engines. Moreover, the critical linguistic features might provide insight into further studies on the mechanism of AD.","PeriodicalId":11126,"journal":{"name":"Dementia and Geriatric Cognitive Disorders","volume":" ","pages":"240-248"},"PeriodicalIF":2.2000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Explainable Alzheimer's Disease Detection Using Linguistic Features from Automatic Speech Recognition.\",\"authors\":\"Lijuan Tang, Zhenglin Zhang, Feifan Feng, Li-Zhuang Yang, Hai Li\",\"doi\":\"10.1159/000531818\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Introduction: Alzheimer's disease (AD) is the most prevalent type of dementia and can cause abnormal cognitive function and progressive loss of essential life skills. Early screening is thus necessary for the prevention and intervention of AD. Speech dysfunction is an early onset symptom of AD patients. Recent studies have demonstrated the promise of automated acoustic assessment using acoustic or linguistic features extracted from speech. However, most previous studies have relied on manual transcription of text to extract linguistic features, which weakens the efficiency of automated assessment. The present study thus investigates the effectiveness of automatic speech recognition (ASR) in building an end-to-end automated speech analysis model for AD detection.Methods: We implemented three publicly available ASR engines and compared the classification performance using the ADReSS-IS2020 dataset. Besides, the SHapley Additive exPlanations algorithm was then used to identify critical features that contributed most to model performance.Results: Three automatic transcription tools obtained mean word error rate texts of 32%, 43%, and 40%, respectively. These automated texts achieved similar or even better results than manual texts in model performance for detecting dementia, achieving classification accuracies of 89.58%, 83.33%, and 81.25%, respectively.Conclusion: Our best model, using ensemble learning, is comparable to the state-of-the-art manual transcription-based methods, suggesting the possibility of an end-to-end medical assistance system for AD detection with ASR engines. Moreover, the critical linguistic features might provide insight into further studies on the mechanism of AD.\",\"PeriodicalId\":11126,\"journal\":{\"name\":\"Dementia and Geriatric Cognitive Disorders\",\"volume\":\" \",\"pages\":\"240-248\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Dementia and Geriatric Cognitive Disorders\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1159/000531818\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2023/7/11 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"CLINICAL NEUROLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Dementia and Geriatric Cognitive Disorders","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1159/000531818","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/7/11 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}

引用次数: 1

摘要

引言：阿尔茨海默病（AD）是最常见的痴呆类型，可导致认知功能异常和基本生活技能的逐渐丧失。因此，早期筛查对AD的预防和干预是必要的。言语功能障碍是AD患者的早期症状。最近的研究已经证明了使用从语音中提取的声学或语言特征进行自动声学评估的前景。然而，以前的大多数研究都依赖于文本的手动转录来提取语言特征，这削弱了自动评估的效率。因此，本研究调查了自动语音识别（ASR）在构建用于AD检测的端到端自动语音分析模型方面的有效性。方法：我们实现了三个公开可用的ASR引擎，并使用ADReSS-IS2020数据集比较了分类性能。此外，还使用SHapley加性exPlanations算法来识别对模型性能贡献最大的关键特征。结果：三个自动转录工具获得的文本平均单词错误率分别为32%、43%和40%。在检测痴呆症的模型性能方面，这些自动化文本取得了与手动文本相似甚至更好的结果，分类准确率分别为89.58%、83.33%和81.25%。结论：我们使用集成学习的最佳模型与最先进的基于手动转录的方法相当，这表明有可能使用ASR引擎建立用于AD检测的端到端医疗辅助系统。此外，批判性语言特征可能为进一步研究AD的机制提供见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Explainable Alzheimer's Disease Detection Using Linguistic Features from Automatic Speech Recognition.

Introduction: Alzheimer's disease (AD) is the most prevalent type of dementia and can cause abnormal cognitive function and progressive loss of essential life skills. Early screening is thus necessary for the prevention and intervention of AD. Speech dysfunction is an early onset symptom of AD patients. Recent studies have demonstrated the promise of automated acoustic assessment using acoustic or linguistic features extracted from speech. However, most previous studies have relied on manual transcription of text to extract linguistic features, which weakens the efficiency of automated assessment. The present study thus investigates the effectiveness of automatic speech recognition (ASR) in building an end-to-end automated speech analysis model for AD detection.

Methods: We implemented three publicly available ASR engines and compared the classification performance using the ADReSS-IS2020 dataset. Besides, the SHapley Additive exPlanations algorithm was then used to identify critical features that contributed most to model performance.

Results: Three automatic transcription tools obtained mean word error rate texts of 32%, 43%, and 40%, respectively. These automated texts achieved similar or even better results than manual texts in model performance for detecting dementia, achieving classification accuracies of 89.58%, 83.33%, and 81.25%, respectively.

Conclusion: Our best model, using ensemble learning, is comparable to the state-of-the-art manual transcription-based methods, suggesting the possibility of an end-to-end medical assistance system for AD detection with ASR engines. Moreover, the critical linguistic features might provide insight into further studies on the mechanism of AD.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Dementia and Geriatric Cognitive Disorders 医学-精神病学

CiteScore

4.70

自引率

0.00%

发文量

审稿时长

2 months

期刊介绍： As a unique forum devoted exclusively to the study of cognitive dysfunction, ''Dementia and Geriatric Cognitive Disorders'' concentrates on Alzheimer’s and Parkinson’s disease, Huntington’s chorea and other neurodegenerative diseases. The journal draws from diverse related research disciplines such as psychogeriatrics, neuropsychology, clinical neurology, morphology, physiology, genetic molecular biology, pathology, biochemistry, immunology, pharmacology and pharmaceutics. Strong emphasis is placed on the publication of research findings from animal studies which are complemented by clinical and therapeutic experience to give an overall appreciation of the field.