Aishwarya Balakrishnan, Jeevan Medikonda, Pramod K. Namboothiri, Manikandan Natarajan
{"title":"Parkinson’s Disease Stage Classification with Gait Analysis using Machine Learning Techniques and SMOTE-based Approach for Class Imbalance Problem","authors":"Aishwarya Balakrishnan, Jeevan Medikonda, Pramod K. Namboothiri, Manikandan Natarajan","doi":"10.1109/DISCOVER55800.2022.9974754","DOIUrl":null,"url":null,"abstract":"High variability in symptom severity and progression rate roots the need for a diverse training dataset, to build an efficient Parkinson’s Disease (PD) severity prediction model. The Physionet database comprises gait signals of PD subjects belonging to various H&Y score-based severity levels but forms an imbalanced dataset. A dataset is said to be imbalanced if the representation of the classification categories within a dataset is not equal. The severity of misclassifying abnormal cases as normal is high and thus is a matter of concern. This paper shows how a technique called Synthetic Minority Oversampling Technique (SMOTE) deals with the class imbalance problem in PD stage-wise classification by improving minority class recognition. The method is validated by quantifying the dissimilarity among samples generated showing the non-existence of overlapping or replication. Spatiotemporal gait parameters along with their regularity and symmetry features are the attributes considered. Classifiers are trained with balanced & imbalanced datasets and their predictive accuracy attributes are compared. Results show an improvement in determining the minority class by the model trained with the balanced dataset, thus improving the generalizability of the model.","PeriodicalId":264177,"journal":{"name":"2022 International Conference on Distributed Computing, VLSI, Electrical Circuits and Robotics ( DISCOVER)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Distributed Computing, VLSI, Electrical Circuits and Robotics ( DISCOVER)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DISCOVER55800.2022.9974754","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
High variability in symptom severity and progression rate roots the need for a diverse training dataset, to build an efficient Parkinson’s Disease (PD) severity prediction model. The Physionet database comprises gait signals of PD subjects belonging to various H&Y score-based severity levels but forms an imbalanced dataset. A dataset is said to be imbalanced if the representation of the classification categories within a dataset is not equal. The severity of misclassifying abnormal cases as normal is high and thus is a matter of concern. This paper shows how a technique called Synthetic Minority Oversampling Technique (SMOTE) deals with the class imbalance problem in PD stage-wise classification by improving minority class recognition. The method is validated by quantifying the dissimilarity among samples generated showing the non-existence of overlapping or replication. Spatiotemporal gait parameters along with their regularity and symmetry features are the attributes considered. Classifiers are trained with balanced & imbalanced datasets and their predictive accuracy attributes are compared. Results show an improvement in determining the minority class by the model trained with the balanced dataset, thus improving the generalizability of the model.