A cross-language speech model for detection of Parkinson's disease.

IF 3.2 4区 医学 Q2 CLINICAL NEUROLOGY
Journal of Neural Transmission Pub Date : 2025-04-01 Epub Date: 2024-12-30 DOI:10.1007/s00702-024-02874-z
Wee Shin Lim, Shu-I Chiu, Pei-Ling Peng, Jyh-Shing Roger Jang, Sol-Hee Lee, Chin-Hsien Lin, Han-Joon Kim
{"title":"A cross-language speech model for detection of Parkinson's disease.","authors":"Wee Shin Lim, Shu-I Chiu, Pei-Ling Peng, Jyh-Shing Roger Jang, Sol-Hee Lee, Chin-Hsien Lin, Han-Joon Kim","doi":"10.1007/s00702-024-02874-z","DOIUrl":null,"url":null,"abstract":"<p><p>Speech change is a biometric marker for Parkinson's disease (PD). However, evaluating speech variability across diverse languages is challenging. We aimed to develop a cross-language algorithm differentiating between PD patients and healthy controls using a Taiwanese and Korean speech data set. We recruited 299 healthy controls and 347 patients with PD from Taiwan and Korea. Participants with PD underwent smartphone-based speech recordings during the \"on\" phase. Each Korean participant performed various speech texts, while the Taiwanese participant read a standardized, fixed-length article. Korean short-speech (≦15 syllables) and long-speech (> 15 syllables) recordings were combined with the Taiwanese speech dataset. The merged dataset was split into a training set (controls vs. early-stage PD) and a validation set (controls vs. advanced-stage PD) to evaluate the model's effectiveness in differentiating PD patients from controls across languages based on speech length. Numerous acoustic and linguistic speech features were extracted and combined with machine learning algorithms to distinguish PD patients from controls. The area under the receiver operating characteristic (AUROC) curve was calculated to assess diagnostic performance. Random forest and AdaBoost classifiers showed an AUROC 0.82 for distinguishing patients with early-stage PD from controls. In the validation cohort, the random forest algorithm maintained this value (0.90) for discriminating advanced-stage PD patients. The model showed superior performance in the combined language cohort (AUROC 0.90) than either the Korean (AUROC 0.87) or Taiwanese (AUROC 0.88) cohorts individually. However, with another merged speech data set of short-speech recordings < 25 characters, the diagnostic performance to identify early-stage PD patients from controls dropped to 0.72 and showed a further limited ability to discriminate advanced-stage patients. Leveraging multifaceted speech features, including both acoustic and linguistic characteristics, could aid in distinguishing PD patients from healthy individuals, even across different languages.</p>","PeriodicalId":16579,"journal":{"name":"Journal of Neural Transmission","volume":" ","pages":"579-590"},"PeriodicalIF":3.2000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Neural Transmission","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00702-024-02874-z","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/30 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Speech change is a biometric marker for Parkinson's disease (PD). However, evaluating speech variability across diverse languages is challenging. We aimed to develop a cross-language algorithm differentiating between PD patients and healthy controls using a Taiwanese and Korean speech data set. We recruited 299 healthy controls and 347 patients with PD from Taiwan and Korea. Participants with PD underwent smartphone-based speech recordings during the "on" phase. Each Korean participant performed various speech texts, while the Taiwanese participant read a standardized, fixed-length article. Korean short-speech (≦15 syllables) and long-speech (> 15 syllables) recordings were combined with the Taiwanese speech dataset. The merged dataset was split into a training set (controls vs. early-stage PD) and a validation set (controls vs. advanced-stage PD) to evaluate the model's effectiveness in differentiating PD patients from controls across languages based on speech length. Numerous acoustic and linguistic speech features were extracted and combined with machine learning algorithms to distinguish PD patients from controls. The area under the receiver operating characteristic (AUROC) curve was calculated to assess diagnostic performance. Random forest and AdaBoost classifiers showed an AUROC 0.82 for distinguishing patients with early-stage PD from controls. In the validation cohort, the random forest algorithm maintained this value (0.90) for discriminating advanced-stage PD patients. The model showed superior performance in the combined language cohort (AUROC 0.90) than either the Korean (AUROC 0.87) or Taiwanese (AUROC 0.88) cohorts individually. However, with another merged speech data set of short-speech recordings < 25 characters, the diagnostic performance to identify early-stage PD patients from controls dropped to 0.72 and showed a further limited ability to discriminate advanced-stage patients. Leveraging multifaceted speech features, including both acoustic and linguistic characteristics, could aid in distinguishing PD patients from healthy individuals, even across different languages.

一种检测帕金森病的跨语言语音模型。
言语变化是帕金森病(PD)的一种生物特征标志。然而,评估不同语言之间的语音变异性是具有挑战性的。我们的目标是使用台湾和韩国语音数据集开发一种跨语言算法来区分PD患者和健康对照。我们从台湾和韩国招募了299名健康对照和347名PD患者。PD患者在“开启”阶段接受了基于智能手机的语音录音。每位韩国参与者都表演了不同的演讲文本,而台湾参与者则阅读了一篇标准的、固定长度的文章。韩国语短词(≦15音节)和长词(bbb15音节)记录与台湾语语音数据集相结合。合并后的数据集被分成一个训练集(对照与早期PD)和一个验证集(对照与晚期PD),以评估该模型在基于语言长度区分PD患者和对照组的有效性。提取大量的声学和语言语音特征,并结合机器学习算法来区分PD患者和对照组。计算受试者工作特征(AUROC)曲线下的面积来评估诊断效果。随机森林和AdaBoost分类器显示,区分早期PD患者和对照组的AUROC为0.82。在验证队列中,随机森林算法在区分晚期PD患者时保持该值(0.90)。该模型在联合语言队列(AUROC为0.90)中的表现优于单独的韩语队列(AUROC为0.87)或台湾队列(AUROC为0.88)。然而,与另一个合并的语音数据集的短语音记录
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Neural Transmission
Journal of Neural Transmission 医学-临床神经学
CiteScore
7.20
自引率
3.00%
发文量
112
审稿时长
2 months
期刊介绍: The investigation of basic mechanisms involved in the pathogenesis of neurological and psychiatric disorders has undoubtedly deepened our knowledge of these types of disorders. The impact of basic neurosciences on the understanding of the pathophysiology of the brain will further increase due to important developments such as the emergence of more specific psychoactive compounds and new technologies. The Journal of Neural Transmission aims to establish an interface between basic sciences and clinical neurology and psychiatry. It intends to put a special emphasis on translational publications of the newest developments in the field from all disciplines of the neural sciences that relate to a better understanding and treatment of neurological and psychiatric disorders.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信