Hua Huo, Shupei Jiao, Dongfang Li, Lan Ma, Ningya Xu
{"title":"Efficient quantification of Parkinson's disease severity using augmented time-series data.","authors":"Hua Huo, Shupei Jiao, Dongfang Li, Lan Ma, Ningya Xu","doi":"10.1371/journal.pone.0319826","DOIUrl":null,"url":null,"abstract":"<p><p>The diagnosis of Parkinson's disease relies heavily on the subjective assessment of physicians, which depends on their individual experience and training, potentially leading to inconsistent diagnostic results. Therefore, developing an objective and efficient diagnostic method is essential to improve the accuracy and timeliness of Parkinson's disease diagnosis. In this study, we utilized the PhysioNet dataset, a time-series dataset comprising data from 93 Parkinson's patients and 73 healthy individuals. The dataset contains vertical ground reaction forces recorded from 16 sensors (8 per foot) during a 2-minute test at a sampling rate of 100 Hz. To address challenges such as limited dataset size, high labeling noise, and high intra-class variability, we performed data preprocessing and applied various data augmentation techniques, including jittering, scaling, rotation, permutation, magnitude warping, time warping, cropping, and linear residuals. These methods were evaluated using one-dimensional-convolutional neural network (1D-ConvNet) and one-dimensional Transformer networks. By conducting 10-fold cross-validation, we observed significant improvements in classification performance. The best data augmentation strategy achieved 90.8% accuracy, 92.0% precision, 91.0% recall, and a 91.0% F1 score in assessing disease severity. These results highlight the importance of selecting appropriate data augmentation techniques for time-series data to improve model generalization and diagnostic reliability, while also offering new insights for researchers working with sensor device data. Our results demonstrate that data-enhanced methods can significantly boost the performance of machine-learning models in the field of Parkinson's disease diagnosis.</p>","PeriodicalId":20189,"journal":{"name":"PLoS ONE","volume":"20 4","pages":"e0319826"},"PeriodicalIF":2.9000,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PLoS ONE","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1371/journal.pone.0319826","RegionNum":3,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
The diagnosis of Parkinson's disease relies heavily on the subjective assessment of physicians, which depends on their individual experience and training, potentially leading to inconsistent diagnostic results. Therefore, developing an objective and efficient diagnostic method is essential to improve the accuracy and timeliness of Parkinson's disease diagnosis. In this study, we utilized the PhysioNet dataset, a time-series dataset comprising data from 93 Parkinson's patients and 73 healthy individuals. The dataset contains vertical ground reaction forces recorded from 16 sensors (8 per foot) during a 2-minute test at a sampling rate of 100 Hz. To address challenges such as limited dataset size, high labeling noise, and high intra-class variability, we performed data preprocessing and applied various data augmentation techniques, including jittering, scaling, rotation, permutation, magnitude warping, time warping, cropping, and linear residuals. These methods were evaluated using one-dimensional-convolutional neural network (1D-ConvNet) and one-dimensional Transformer networks. By conducting 10-fold cross-validation, we observed significant improvements in classification performance. The best data augmentation strategy achieved 90.8% accuracy, 92.0% precision, 91.0% recall, and a 91.0% F1 score in assessing disease severity. These results highlight the importance of selecting appropriate data augmentation techniques for time-series data to improve model generalization and diagnostic reliability, while also offering new insights for researchers working with sensor device data. Our results demonstrate that data-enhanced methods can significantly boost the performance of machine-learning models in the field of Parkinson's disease diagnosis.
期刊介绍:
PLOS ONE is an international, peer-reviewed, open-access, online publication. PLOS ONE welcomes reports on primary research from any scientific discipline. It provides:
* Open-access—freely accessible online, authors retain copyright
* Fast publication times
* Peer review by expert, practicing researchers
* Post-publication tools to indicate quality and impact
* Community-based dialogue on articles
* Worldwide media coverage