Parkinson’s Disease Stage Classification with Gait Analysis using Machine Learning Techniques and SMOTE-based Approach for Class Imbalance Problem

Aishwarya Balakrishnan, Jeevan Medikonda, Pramod K. Namboothiri, Manikandan Natarajan
{"title":"Parkinson’s Disease Stage Classification with Gait Analysis using Machine Learning Techniques and SMOTE-based Approach for Class Imbalance Problem","authors":"Aishwarya Balakrishnan, Jeevan Medikonda, Pramod K. Namboothiri, Manikandan Natarajan","doi":"10.1109/DISCOVER55800.2022.9974754","DOIUrl":null,"url":null,"abstract":"High variability in symptom severity and progression rate roots the need for a diverse training dataset, to build an efficient Parkinson’s Disease (PD) severity prediction model. The Physionet database comprises gait signals of PD subjects belonging to various H&Y score-based severity levels but forms an imbalanced dataset. A dataset is said to be imbalanced if the representation of the classification categories within a dataset is not equal. The severity of misclassifying abnormal cases as normal is high and thus is a matter of concern. This paper shows how a technique called Synthetic Minority Oversampling Technique (SMOTE) deals with the class imbalance problem in PD stage-wise classification by improving minority class recognition. The method is validated by quantifying the dissimilarity among samples generated showing the non-existence of overlapping or replication. Spatiotemporal gait parameters along with their regularity and symmetry features are the attributes considered. Classifiers are trained with balanced & imbalanced datasets and their predictive accuracy attributes are compared. Results show an improvement in determining the minority class by the model trained with the balanced dataset, thus improving the generalizability of the model.","PeriodicalId":264177,"journal":{"name":"2022 International Conference on Distributed Computing, VLSI, Electrical Circuits and Robotics ( DISCOVER)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Distributed Computing, VLSI, Electrical Circuits and Robotics ( DISCOVER)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DISCOVER55800.2022.9974754","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

High variability in symptom severity and progression rate roots the need for a diverse training dataset, to build an efficient Parkinson’s Disease (PD) severity prediction model. The Physionet database comprises gait signals of PD subjects belonging to various H&Y score-based severity levels but forms an imbalanced dataset. A dataset is said to be imbalanced if the representation of the classification categories within a dataset is not equal. The severity of misclassifying abnormal cases as normal is high and thus is a matter of concern. This paper shows how a technique called Synthetic Minority Oversampling Technique (SMOTE) deals with the class imbalance problem in PD stage-wise classification by improving minority class recognition. The method is validated by quantifying the dissimilarity among samples generated showing the non-existence of overlapping or replication. Spatiotemporal gait parameters along with their regularity and symmetry features are the attributes considered. Classifiers are trained with balanced & imbalanced datasets and their predictive accuracy attributes are compared. Results show an improvement in determining the minority class by the model trained with the balanced dataset, thus improving the generalizability of the model.
基于机器学习技术的步态分析与帕金森病分期分类和基于smote的分类不平衡问题
症状严重程度和进展率的高度可变性导致需要多样化的训练数据集,以建立有效的帕金森病(PD)严重程度预测模型。Physionet数据库包含PD受试者的步态信号,属于各种基于H&Y评分的严重程度,但形成了一个不平衡的数据集。如果数据集中分类类别的表示不相等,则数据集被称为不平衡的。将异常病例误分类为正常病例的严重程度很高,因此值得关注。本文介绍了一种称为合成少数派过采样技术(SMOTE)的方法,该方法通过改进少数派类识别来解决PD阶段分类中的类不平衡问题。通过量化生成的样品之间的不相似性来验证该方法,表明不存在重叠或复制。时空步态参数及其规律性和对称性特征是考虑的属性。用平衡和不平衡数据集训练分类器,并比较它们的预测精度属性。结果表明,使用平衡数据集训练的模型在确定少数类别方面有很大的改进,从而提高了模型的泛化能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信