Simona Aresta, Petronilla Battista, Cinzia Palmirotta, Serena Tagliente, Gianvito Lagravinese, Paola Santacesaria, Allegra Benzini, Davide Mongelli, Brigida Minafra, Christian Lunetta, Adolfo M. García, Christian Salvatore
{"title":"通过自然语言处理的帕金森病的数字表型","authors":"Simona Aresta, Petronilla Battista, Cinzia Palmirotta, Serena Tagliente, Gianvito Lagravinese, Paola Santacesaria, Allegra Benzini, Davide Mongelli, Brigida Minafra, Christian Lunetta, Adolfo M. García, Christian Salvatore","doi":"10.1038/s41531-025-01050-8","DOIUrl":null,"url":null,"abstract":"<p>Frontostriatal degeneration in Parkinson’s disease (PD) is associated with language deficits, which can be identified using natural language processing, a remarkable tool for digital-phenotyping. Current evidence is mostly blind to the disorder’s cognitive phenotypes. We validated an AI-driven approach to capture digital language markers of PD with and without mild cognitive impairment (PD-MCI, PD-nMCI) relative to healthy controls (HCs). Analyzing the connected speech of participants, we extracted linguistic features with CLAN software. Classification was performed using SVM and RFE. Discrimination between PD and HCs reached an AUC of 77%, with even better results for subgroup analyses (AUC: 85% PD-nMCI vs. HCs; 83% PD-MCI vs. HCs; 75% PD-nMCI vs. PD-MCI). Key linguistic features included retracing, action verb, utterance error, and verbless-utterance ratios. Despite the small sample size, which may limit statistical power and generalizability, this study highlights the foundational potential of linguistic digital markers for early diagnosis and phenotyping of PD.</p>","PeriodicalId":19706,"journal":{"name":"NPJ Parkinson's Disease","volume":"644 1","pages":""},"PeriodicalIF":8.2000,"publicationDate":"2025-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Digital phenotyping of Parkinson’s disease via natural language processing\",\"authors\":\"Simona Aresta, Petronilla Battista, Cinzia Palmirotta, Serena Tagliente, Gianvito Lagravinese, Paola Santacesaria, Allegra Benzini, Davide Mongelli, Brigida Minafra, Christian Lunetta, Adolfo M. García, Christian Salvatore\",\"doi\":\"10.1038/s41531-025-01050-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Frontostriatal degeneration in Parkinson’s disease (PD) is associated with language deficits, which can be identified using natural language processing, a remarkable tool for digital-phenotyping. Current evidence is mostly blind to the disorder’s cognitive phenotypes. We validated an AI-driven approach to capture digital language markers of PD with and without mild cognitive impairment (PD-MCI, PD-nMCI) relative to healthy controls (HCs). Analyzing the connected speech of participants, we extracted linguistic features with CLAN software. Classification was performed using SVM and RFE. Discrimination between PD and HCs reached an AUC of 77%, with even better results for subgroup analyses (AUC: 85% PD-nMCI vs. HCs; 83% PD-MCI vs. HCs; 75% PD-nMCI vs. PD-MCI). Key linguistic features included retracing, action verb, utterance error, and verbless-utterance ratios. Despite the small sample size, which may limit statistical power and generalizability, this study highlights the foundational potential of linguistic digital markers for early diagnosis and phenotyping of PD.</p>\",\"PeriodicalId\":19706,\"journal\":{\"name\":\"NPJ Parkinson's Disease\",\"volume\":\"644 1\",\"pages\":\"\"},\"PeriodicalIF\":8.2000,\"publicationDate\":\"2025-06-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"NPJ Parkinson's Disease\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1038/s41531-025-01050-8\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"NEUROSCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"NPJ Parkinson's Disease","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1038/s41531-025-01050-8","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NEUROSCIENCES","Score":null,"Total":0}
引用次数: 0
摘要
帕金森病(PD)的额纹状体变性与语言缺陷有关,这可以通过自然语言处理来识别,这是一种重要的数字表型分析工具。目前的证据大多对这种疾病的认知表型视而不见。我们验证了一种人工智能驱动的方法,用于捕获与健康对照(hc)相比,有或没有轻度认知障碍的PD (PD- mci, PD- nmci)的数字语言标记。通过分析参与者的连接语音,利用CLAN软件提取语言特征。使用SVM和RFE进行分类。PD和hcc之间的鉴别AUC达到77%,亚组分析的结果甚至更好(AUC: 85% PD- nmci vs. hcc;83% PD-MCI vs. hcc;75% PD-nMCI vs PD-MCI)。主要的语言特征包括回溯、动作动词、发音错误和无动词发音比例。尽管样本量小,可能会限制统计能力和普遍性,但本研究强调了语言数字标记在PD早期诊断和表型分析中的基础潜力。
Digital phenotyping of Parkinson’s disease via natural language processing
Frontostriatal degeneration in Parkinson’s disease (PD) is associated with language deficits, which can be identified using natural language processing, a remarkable tool for digital-phenotyping. Current evidence is mostly blind to the disorder’s cognitive phenotypes. We validated an AI-driven approach to capture digital language markers of PD with and without mild cognitive impairment (PD-MCI, PD-nMCI) relative to healthy controls (HCs). Analyzing the connected speech of participants, we extracted linguistic features with CLAN software. Classification was performed using SVM and RFE. Discrimination between PD and HCs reached an AUC of 77%, with even better results for subgroup analyses (AUC: 85% PD-nMCI vs. HCs; 83% PD-MCI vs. HCs; 75% PD-nMCI vs. PD-MCI). Key linguistic features included retracing, action verb, utterance error, and verbless-utterance ratios. Despite the small sample size, which may limit statistical power and generalizability, this study highlights the foundational potential of linguistic digital markers for early diagnosis and phenotyping of PD.
期刊介绍:
npj Parkinson's Disease is a comprehensive open access journal that covers a wide range of research areas related to Parkinson's disease. It publishes original studies in basic science, translational research, and clinical investigations. The journal is dedicated to advancing our understanding of Parkinson's disease by exploring various aspects such as anatomy, etiology, genetics, cellular and molecular physiology, neurophysiology, epidemiology, and therapeutic development. By providing free and immediate access to the scientific and Parkinson's disease community, npj Parkinson's Disease promotes collaboration and knowledge sharing among researchers and healthcare professionals.