Simona Aresta, Petronilla Battista, Cinzia Palmirotta, Serena Tagliente, Gianvito Lagravinese, Paola Santacesaria, Allegra Benzini, Davide Mongelli, Brigida Minafra, Christian Lunetta, Adolfo M. García, Christian Salvatore
{"title":"Digital phenotyping of Parkinson’s disease via natural language processing","authors":"Simona Aresta, Petronilla Battista, Cinzia Palmirotta, Serena Tagliente, Gianvito Lagravinese, Paola Santacesaria, Allegra Benzini, Davide Mongelli, Brigida Minafra, Christian Lunetta, Adolfo M. García, Christian Salvatore","doi":"10.1038/s41531-025-01050-8","DOIUrl":null,"url":null,"abstract":"<p>Frontostriatal degeneration in Parkinson’s disease (PD) is associated with language deficits, which can be identified using natural language processing, a remarkable tool for digital-phenotyping. Current evidence is mostly blind to the disorder’s cognitive phenotypes. We validated an AI-driven approach to capture digital language markers of PD with and without mild cognitive impairment (PD-MCI, PD-nMCI) relative to healthy controls (HCs). Analyzing the connected speech of participants, we extracted linguistic features with CLAN software. Classification was performed using SVM and RFE. Discrimination between PD and HCs reached an AUC of 77%, with even better results for subgroup analyses (AUC: 85% PD-nMCI vs. HCs; 83% PD-MCI vs. HCs; 75% PD-nMCI vs. PD-MCI). Key linguistic features included retracing, action verb, utterance error, and verbless-utterance ratios. Despite the small sample size, which may limit statistical power and generalizability, this study highlights the foundational potential of linguistic digital markers for early diagnosis and phenotyping of PD.</p>","PeriodicalId":19706,"journal":{"name":"NPJ Parkinson's Disease","volume":"644 1","pages":""},"PeriodicalIF":8.2000,"publicationDate":"2025-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"NPJ Parkinson's Disease","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1038/s41531-025-01050-8","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NEUROSCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Frontostriatal degeneration in Parkinson’s disease (PD) is associated with language deficits, which can be identified using natural language processing, a remarkable tool for digital-phenotyping. Current evidence is mostly blind to the disorder’s cognitive phenotypes. We validated an AI-driven approach to capture digital language markers of PD with and without mild cognitive impairment (PD-MCI, PD-nMCI) relative to healthy controls (HCs). Analyzing the connected speech of participants, we extracted linguistic features with CLAN software. Classification was performed using SVM and RFE. Discrimination between PD and HCs reached an AUC of 77%, with even better results for subgroup analyses (AUC: 85% PD-nMCI vs. HCs; 83% PD-MCI vs. HCs; 75% PD-nMCI vs. PD-MCI). Key linguistic features included retracing, action verb, utterance error, and verbless-utterance ratios. Despite the small sample size, which may limit statistical power and generalizability, this study highlights the foundational potential of linguistic digital markers for early diagnosis and phenotyping of PD.
期刊介绍:
npj Parkinson's Disease is a comprehensive open access journal that covers a wide range of research areas related to Parkinson's disease. It publishes original studies in basic science, translational research, and clinical investigations. The journal is dedicated to advancing our understanding of Parkinson's disease by exploring various aspects such as anatomy, etiology, genetics, cellular and molecular physiology, neurophysiology, epidemiology, and therapeutic development. By providing free and immediate access to the scientific and Parkinson's disease community, npj Parkinson's Disease promotes collaboration and knowledge sharing among researchers and healthcare professionals.