使用声音生物标志物进行老年人群帕金森病早期检测的可解释机器学习。

IF 4.5 2区医学 Q2 GERIATRICS & GERONTOLOGY

Frontiers in Aging Neuroscience Pub Date : 2025-09-05 eCollection Date: 2025-01-01 DOI:10.3389/fnagi.2025.1672971

Bright Egbo, Zhanbota Nigmetolla, Naveed Ahmad Khan, Prashant K Jamwal

{"title":"使用声音生物标志物进行老年人群帕金森病早期检测的可解释机器学习。","authors":"Bright Egbo, Zhanbota Nigmetolla, Naveed Ahmad Khan, Prashant K Jamwal","doi":"10.3389/fnagi.2025.1672971","DOIUrl":null,"url":null,"abstract":"Introduction: Parkinson's Disease (PD) is a progressive neurodegenerative disorder that significantly affects the aging population, creating a growing burden on global health systems. Early detection of PD is clinically challenging due to the gradual and ambiguous onset of symptoms.Methods: This study presents a machine-learning framework for the early identification of PD using non-invasive biomedical voice biomarkers from the UCI Parkinson's dataset. The dataset consists of 195 sustained phonation recordings from 31 participants (23 PD and 8 healthy controls, ages 46-85). The methodology includes subject-level stratified splitting and normalization, along with BorderlineSMOTE to address class imbalance. Initially, an XGBoost model is applied to select the top 10 acoustic features, followed by a Bayesian-optimized XGBoost classifier, with the decision threshold tuned via F1-maximization on validation data.Results: On the held-out test set, the model achieves 98.0% accuracy, 0.97 macro-F1, and 0.991 ROC-AUC. This performance exceeds that of a deep neural network baseline by 4.0 percentage points in accuracy (94.0% to 98.0%), 4.3 percentage points in macro-F1 (92.7% to 97.0%), and 0.050 in AUC (0.941 to 0.991). Compared to a classical SVM, it outperforms by 7.0 percentage points in accuracy (91.0% to 98.0%), 6.5 percentage points in macro-F1 (90.5% to 97.0%), and 0.089 in AUC (0.902 to 0.991).Discussion: Model decisions are elucidated using SHAP, offering global and patient-specific insights into the influential voice features. These findings indicate the feasibility of a non-invasive, scalable, and explainable voice-based tool for early PD screening, highlighting its potential integration into mobile or telehealth diagnostic platforms.","PeriodicalId":12450,"journal":{"name":"Frontiers in Aging Neuroscience","volume":"17 ","pages":"1672971"},"PeriodicalIF":4.5000,"publicationDate":"2025-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12446257/pdf/","citationCount":"0","resultStr":"{\"title\":\"Explainable machine learning for early detection of Parkinson's disease in aging populations using vocal biomarkers.\",\"authors\":\"Bright Egbo, Zhanbota Nigmetolla, Naveed Ahmad Khan, Prashant K Jamwal\",\"doi\":\"10.3389/fnagi.2025.1672971\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Introduction: Parkinson's Disease (PD) is a progressive neurodegenerative disorder that significantly affects the aging population, creating a growing burden on global health systems. Early detection of PD is clinically challenging due to the gradual and ambiguous onset of symptoms.Methods: This study presents a machine-learning framework for the early identification of PD using non-invasive biomedical voice biomarkers from the UCI Parkinson's dataset. The dataset consists of 195 sustained phonation recordings from 31 participants (23 PD and 8 healthy controls, ages 46-85). The methodology includes subject-level stratified splitting and normalization, along with BorderlineSMOTE to address class imbalance. Initially, an XGBoost model is applied to select the top 10 acoustic features, followed by a Bayesian-optimized XGBoost classifier, with the decision threshold tuned via F1-maximization on validation data.Results: On the held-out test set, the model achieves 98.0% accuracy, 0.97 macro-F1, and 0.991 ROC-AUC. This performance exceeds that of a deep neural network baseline by 4.0 percentage points in accuracy (94.0% to 98.0%), 4.3 percentage points in macro-F1 (92.7% to 97.0%), and 0.050 in AUC (0.941 to 0.991). Compared to a classical SVM, it outperforms by 7.0 percentage points in accuracy (91.0% to 98.0%), 6.5 percentage points in macro-F1 (90.5% to 97.0%), and 0.089 in AUC (0.902 to 0.991).Discussion: Model decisions are elucidated using SHAP, offering global and patient-specific insights into the influential voice features. These findings indicate the feasibility of a non-invasive, scalable, and explainable voice-based tool for early PD screening, highlighting its potential integration into mobile or telehealth diagnostic platforms.\",\"PeriodicalId\":12450,\"journal\":{\"name\":\"Frontiers in Aging Neuroscience\",\"volume\":\"17 \",\"pages\":\"1672971\"},\"PeriodicalIF\":4.5000,\"publicationDate\":\"2025-09-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12446257/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in Aging Neuroscience\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.3389/fnagi.2025.1672971\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"GERIATRICS & GERONTOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Aging Neuroscience","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3389/fnagi.2025.1672971","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"GERIATRICS & GERONTOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

帕金森氏病（PD）是一种进行性神经退行性疾病，严重影响老龄化人口，给全球卫生系统造成越来越大的负担。由于症状的逐渐和模糊的发作，PD的早期检测在临床上具有挑战性。方法：本研究提出了一个机器学习框架，用于使用来自UCI帕金森数据集的非侵入性生物医学语音生物标志物来早期识别PD。该数据集包括来自31名参与者（23名PD和8名健康对照，年龄在46-85岁）的195个持续发声记录。该方法包括主题级别的分层拆分和规范化，以及用于解决类别不平衡的BorderlineSMOTE。首先，使用XGBoost模型选择前10个声学特征，然后使用贝叶斯优化的XGBoost分类器，并通过对验证数据进行f1最大化来调整决策阈值。结果：在hold -out测试集上，模型的准确率达到98.0%，macro-F1为0.97，ROC-AUC为0.991。该性能在准确率（94.0%至98.0%）、宏观f1（92.7%至97.0%）和AUC（0.941至0.991）方面分别比深度神经网络基线高出4.0个百分点、4.3个百分点和0.050个百分点。与经典SVM相比，它在准确率（91.0%到98.0%）上高出7.0个百分点，在宏观f1（90.5%到97.0%）上高出6.5个百分点，在AUC（0.902到0.991）上高出0.089个百分点。讨论：使用SHAP阐明模型决策，为有影响的语音特征提供全局和患者特定的见解。这些发现表明，一种无创、可扩展、可解释的基于语音的早期PD筛查工具的可行性，突出了其与移动或远程医疗诊断平台集成的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Explainable machine learning for early detection of Parkinson's disease in aging populations using vocal biomarkers.

查看原文本刊更多论文

Explainable machine learning for early detection of Parkinson's disease in aging populations using vocal biomarkers.

Introduction: Parkinson's Disease (PD) is a progressive neurodegenerative disorder that significantly affects the aging population, creating a growing burden on global health systems. Early detection of PD is clinically challenging due to the gradual and ambiguous onset of symptoms.

Methods: This study presents a machine-learning framework for the early identification of PD using non-invasive biomedical voice biomarkers from the UCI Parkinson's dataset. The dataset consists of 195 sustained phonation recordings from 31 participants (23 PD and 8 healthy controls, ages 46-85). The methodology includes subject-level stratified splitting and normalization, along with BorderlineSMOTE to address class imbalance. Initially, an XGBoost model is applied to select the top 10 acoustic features, followed by a Bayesian-optimized XGBoost classifier, with the decision threshold tuned via F1-maximization on validation data.

Results: On the held-out test set, the model achieves 98.0% accuracy, 0.97 macro-F1, and 0.991 ROC-AUC. This performance exceeds that of a deep neural network baseline by 4.0 percentage points in accuracy (94.0% to 98.0%), 4.3 percentage points in macro-F1 (92.7% to 97.0%), and 0.050 in AUC (0.941 to 0.991). Compared to a classical SVM, it outperforms by 7.0 percentage points in accuracy (91.0% to 98.0%), 6.5 percentage points in macro-F1 (90.5% to 97.0%), and 0.089 in AUC (0.902 to 0.991).

Discussion: Model decisions are elucidated using SHAP, offering global and patient-specific insights into the influential voice features. These findings indicate the feasibility of a non-invasive, scalable, and explainable voice-based tool for early PD screening, highlighting its potential integration into mobile or telehealth diagnostic platforms.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Frontiers in Aging Neuroscience GERIATRICS & GERONTOLOGY-NEUROSCIENCES

CiteScore

6.30

自引率

8.30%

发文量

1426

期刊介绍： Frontiers in Aging Neuroscience is a leading journal in its field, publishing rigorously peer-reviewed research that advances our understanding of the mechanisms of Central Nervous System aging and age-related neural diseases. Specialty Chief Editor Thomas Wisniewski at the New York University School of Medicine is supported by an outstanding Editorial Board of international researchers. This multidisciplinary open-access journal is at the forefront of disseminating and communicating scientific knowledge and impactful discoveries to researchers, academics, clinicians and the public worldwide.