Experimental analysis of filtering-based feature selection techniques for fetal health classification

Q3 Engineering
I. Jebadurai, G. Paulraj, Jebaveerasingh Jebadurai, S. Silas
{"title":"Experimental analysis of filtering-based feature selection techniques for fetal health classification","authors":"I. Jebadurai, G. Paulraj, Jebaveerasingh Jebadurai, S. Silas","doi":"10.2298/sjee2202207j","DOIUrl":null,"url":null,"abstract":"Machine learning techniques enable computers to acquire intelligence through learning. Trained machines can carry out various tasks, such as prediction, classification, clustering, and recommendation, within a wide variety of applications. Classification is a supervised learning technique that can be improved using feature selection techniques such as filtering, wrapping, and embedding. This paper explores the impact of filtering-based feature selection techniques on classification methods, and focuses on an analysis of correlationbased filtering techniques based on Pearson, Spearman, and Kendall rank correlation. Similarly, we explore the impacts of using statistical filtering techniques such as mutual information, chi-squared score, the ANOVA univariate test, and the univariate ROC-AUC. These filtering techniques are evaluated by implementing them with the k-nearest neighbor, support vector machine, decision tree, and Gaussian na?ve Bayes classification methods. Our experiments were carried out using a fetal heart rate dataset, and the performance of each combination of methods was measured based on precision, recall, F1-score, and accuracy. An analysis of the experimental results showed that the performance metrics for the Gaussian na?ve Bayes and k-nearest neighbor methods were improved by 3% through the use of the statistical feature selection technique, and a 4% improvement was observed for the decision tree and support vector machine methods using a correlation-based filtering technique. Of the statistical feature selection techniques, ANOVA and ROC-AUC were the best as they improved the accuracy by 92%; compared to the other correlation techniques, the Spearman correlation coefficient gave the best results, as it also improved the accuracy by 92%.","PeriodicalId":37704,"journal":{"name":"Serbian Journal of Electrical Engineering","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Serbian Journal of Electrical Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2298/sjee2202207j","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Engineering","Score":null,"Total":0}
引用次数: 4

Abstract

Machine learning techniques enable computers to acquire intelligence through learning. Trained machines can carry out various tasks, such as prediction, classification, clustering, and recommendation, within a wide variety of applications. Classification is a supervised learning technique that can be improved using feature selection techniques such as filtering, wrapping, and embedding. This paper explores the impact of filtering-based feature selection techniques on classification methods, and focuses on an analysis of correlationbased filtering techniques based on Pearson, Spearman, and Kendall rank correlation. Similarly, we explore the impacts of using statistical filtering techniques such as mutual information, chi-squared score, the ANOVA univariate test, and the univariate ROC-AUC. These filtering techniques are evaluated by implementing them with the k-nearest neighbor, support vector machine, decision tree, and Gaussian na?ve Bayes classification methods. Our experiments were carried out using a fetal heart rate dataset, and the performance of each combination of methods was measured based on precision, recall, F1-score, and accuracy. An analysis of the experimental results showed that the performance metrics for the Gaussian na?ve Bayes and k-nearest neighbor methods were improved by 3% through the use of the statistical feature selection technique, and a 4% improvement was observed for the decision tree and support vector machine methods using a correlation-based filtering technique. Of the statistical feature selection techniques, ANOVA and ROC-AUC were the best as they improved the accuracy by 92%; compared to the other correlation techniques, the Spearman correlation coefficient gave the best results, as it also improved the accuracy by 92%.
基于过滤的胎儿健康分类特征选择技术的实验分析
机器学习技术使计算机能够通过学习获得智能。经过训练的机器可以在各种各样的应用程序中执行各种任务,例如预测、分类、聚类和推荐。分类是一种监督学习技术,可以使用过滤、包装和嵌入等特征选择技术来改进。本文探讨了基于过滤的特征选择技术对分类方法的影响,并重点分析了基于Pearson、Spearman和Kendall秩相关的基于关联的过滤技术。同样,我们探讨了使用统计过滤技术的影响,如互信息、卡方评分、单变量方差分析和单变量ROC-AUC。这些过滤技术通过使用k近邻、支持向量机、决策树和高斯na?5贝叶斯分类方法。我们的实验使用胎儿心率数据集进行,并根据精确度、召回率、f1评分和准确性来衡量每种方法组合的性能。对实验结果的分析表明,高斯滤波的性能指标是可靠的。贝叶斯和k近邻方法通过使用统计特征选择技术提高了3%,决策树和支持向量机方法使用基于相关的过滤技术提高了4%。在统计特征选择技术中,ANOVA和ROC-AUC的准确率提高了92%,是最好的;与其他相关技术相比,Spearman相关系数给出了最好的结果,因为它也将准确率提高了92%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Serbian Journal of Electrical Engineering
Serbian Journal of Electrical Engineering Energy-Energy Engineering and Power Technology
CiteScore
1.30
自引率
0.00%
发文量
16
审稿时长
25 weeks
期刊介绍: The main aims of the Journal are to publish peer review papers giving results of the fundamental and applied research in the field of electrical engineering. The Journal covers a wide scope of problems in the following scientific fields: Applied and Theoretical Electromagnetics, Instrumentation and Measurement, Power Engineering, Power Systems, Electrical Machines, Electrical Drives, Electronics, Telecommunications, Computer Engineering, Automatic Control and Systems, Mechatronics, Electrical Materials, Information Technologies, Engineering Mathematics, etc.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信