Regularization based discriminative feature pattern selection for the classification of Parkinson cases using machine learning

IF 1.2 Q3 Computer Science
Kamalakannan Kaliyan, Anand Ganesan
{"title":"Regularization based discriminative feature pattern selection for the classification of Parkinson cases using machine learning","authors":"Kamalakannan Kaliyan, Anand Ganesan","doi":"10.1515/bams-2021-0064","DOIUrl":null,"url":null,"abstract":"Abstract Objectives This paper focuses on developing a regularization-based feature selection approach to select the most effective attributes from the Parkinson’s speech dataset. Parkinson’s disease is a medical condition that progresses as the dopamine-producing nerve cells are affected. Early diagnosis often reduces the effect on the individuals, minimizes the advancement over time. In recent times, intelligent computational models are used in many complex cases to diagnose a clinical condition with high precision. These models are intended to find meaningful representation from the data to diagnose the disease. Machine learning acts as a tool, gears up the model learning process through a mathematical baseline. But, not in all cases, machine learning will be demanded to perform optimally. It comes with a few constraints, mainly the representation of the data. The learning models expect a clean, noise-free input, which in-turns produces better discriminative patterns over different categories of classes. Methods The proposed model identified five candidate features as predictors. This feature subset is trained with different varieties of supervised classifiers to trace out the best-performing model. Results The results are validated through accuracy, precision, recall, and receiver’s operational characteristic curves. The proposed regularization- based feature selection model outperformed the benchmark algorithms by attaining 100% accuracy on most of the classifiers, other than linear discriminant analysis (99.90%) and naïve Bayes (99.51%). Conclusions This paper exhibits the need for intelligent models to analyze complex data patterns to assist medical practitioners in better disease diagnosis. The results exhibit that the regularization methods find the best features based on their importance score, which improved the model performance over other feature selection methods.","PeriodicalId":42620,"journal":{"name":"Bio-Algorithms and Med-Systems","volume":null,"pages":null},"PeriodicalIF":1.2000,"publicationDate":"2021-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bio-Algorithms and Med-Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1515/bams-2021-0064","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 1

Abstract

Abstract Objectives This paper focuses on developing a regularization-based feature selection approach to select the most effective attributes from the Parkinson’s speech dataset. Parkinson’s disease is a medical condition that progresses as the dopamine-producing nerve cells are affected. Early diagnosis often reduces the effect on the individuals, minimizes the advancement over time. In recent times, intelligent computational models are used in many complex cases to diagnose a clinical condition with high precision. These models are intended to find meaningful representation from the data to diagnose the disease. Machine learning acts as a tool, gears up the model learning process through a mathematical baseline. But, not in all cases, machine learning will be demanded to perform optimally. It comes with a few constraints, mainly the representation of the data. The learning models expect a clean, noise-free input, which in-turns produces better discriminative patterns over different categories of classes. Methods The proposed model identified five candidate features as predictors. This feature subset is trained with different varieties of supervised classifiers to trace out the best-performing model. Results The results are validated through accuracy, precision, recall, and receiver’s operational characteristic curves. The proposed regularization- based feature selection model outperformed the benchmark algorithms by attaining 100% accuracy on most of the classifiers, other than linear discriminant analysis (99.90%) and naïve Bayes (99.51%). Conclusions This paper exhibits the need for intelligent models to analyze complex data patterns to assist medical practitioners in better disease diagnosis. The results exhibit that the regularization methods find the best features based on their importance score, which improved the model performance over other feature selection methods.
基于正则化的判别特征模式选择用于机器学习的帕金森病例分类
摘要目的本文致力于开发一种基于正则化的特征选择方法,从帕金森氏语音数据集中选择最有效的属性。帕金森病是一种随着产生多巴胺的神经细胞受到影响而发展的疾病。早期诊断通常会减少对个体的影响,最大限度地减少随着时间的推移的进展。近年来,智能计算模型被用于许多复杂的病例,以高精度诊断临床状况。这些模型旨在从数据中找到有意义的表示,以诊断疾病。机器学习作为一种工具,通过数学基线来加速模型学习过程。但是,并不是在所有情况下,机器学习都需要达到最佳性能。它附带了一些约束,主要是数据的表示。学习模型期望一个干净、无噪声的输入,从而在不同类别的类中产生更好的判别模式。方法所提出的模型确定了五个候选特征作为预测因子。该特征子集使用不同种类的监督分类器进行训练,以找出性能最佳的模型。结果通过准确度、精密度、召回率和受试者的操作特性曲线对结果进行了验证。除了线性判别分析(99.90%)和朴素贝叶斯(99.51%)之外,所提出的基于正则化的特征选择模型在大多数分类器上都达到了100%的准确率,优于基准算法。结论本文表明需要智能模型来分析复杂的数据模式,以帮助医生更好地进行疾病诊断。结果表明,正则化方法根据重要度得分找到最佳特征,与其他特征选择方法相比,这提高了模型的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Bio-Algorithms and Med-Systems
Bio-Algorithms and Med-Systems MATHEMATICAL & COMPUTATIONAL BIOLOGY-
CiteScore
3.80
自引率
0.00%
发文量
3
期刊介绍: The journal Bio-Algorithms and Med-Systems (BAMS), edited by the Jagiellonian University Medical College, provides a forum for the exchange of information in the interdisciplinary fields of computational methods applied in medicine, presenting new algorithms and databases that allows the progress in collaborations between medicine, informatics, physics, and biochemistry. Projects linking specialists representing these disciplines are welcome to be published in this Journal. Articles in BAMS are published in English. Topics Bioinformatics Systems biology Telemedicine E-Learning in Medicine Patient''s electronic record Image processing Medical databases.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信