Efficient heart disease prediction-based on optimal feature selection using DFCSS and classification by improved Elman-SFO

IF 16.4 1区 化学 Q1 CHEMISTRY, MULTIDISCIPLINARY
Jaishri Wankhede, Magesh Kumar, Palaniappan Sambandam
{"title":"Efficient heart disease prediction-based on optimal feature selection using DFCSS and classification by improved Elman-SFO","authors":"Jaishri Wankhede,&nbsp;Magesh Kumar,&nbsp;Palaniappan Sambandam","doi":"10.1049/iet-syb.2020.0041","DOIUrl":null,"url":null,"abstract":"<div>\n <p>Prediction of cardiovascular disease (CVD) is a critical challenge in the area of clinical data analysis. In this study, an efficient heart disease prediction is developed based on optimal feature selection. Initially, the data pre-processing process is performed using data cleaning, data transformation, missing values imputation, and data normalisation. Then the decision function-based chaotic salp swarm (DFCSS) algorithm is used to select the optimal features in the feature selection process. Then the chosen attributes are given to the improved Elman neural network (IENN) for data classification. Here, the sailfish optimisation (SFO) algorithm is used to compute the optimal weight value of IENN. The combination of DFCSS–IENN-based SFO (IESFO) algorithm effectively predicts heart disease. The proposed (DFCSS–IESFO) approach is implemented in the Python environment using two different datasets such as the University of California Irvine (UCI) Cleveland heart disease dataset and CVD dataset. The simulation results proved that the proposed scheme achieved a high-classification accuracy of 98.7% for the CVD dataset and 98% for the UCI dataset compared to other classifiers, such as support vector machine, K-nearest neighbour, Elman neural network, Gaussian Naive Bayes, logistic regression, random forest, and decision tree.</p>\n </div>","PeriodicalId":1,"journal":{"name":"Accounts of Chemical Research","volume":null,"pages":null},"PeriodicalIF":16.4000,"publicationDate":"2020-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8687167/pdf/SYB2-14-380.pdf","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accounts of Chemical Research","FirstCategoryId":"99","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/iet-syb.2020.0041","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 11

Abstract

Prediction of cardiovascular disease (CVD) is a critical challenge in the area of clinical data analysis. In this study, an efficient heart disease prediction is developed based on optimal feature selection. Initially, the data pre-processing process is performed using data cleaning, data transformation, missing values imputation, and data normalisation. Then the decision function-based chaotic salp swarm (DFCSS) algorithm is used to select the optimal features in the feature selection process. Then the chosen attributes are given to the improved Elman neural network (IENN) for data classification. Here, the sailfish optimisation (SFO) algorithm is used to compute the optimal weight value of IENN. The combination of DFCSS–IENN-based SFO (IESFO) algorithm effectively predicts heart disease. The proposed (DFCSS–IESFO) approach is implemented in the Python environment using two different datasets such as the University of California Irvine (UCI) Cleveland heart disease dataset and CVD dataset. The simulation results proved that the proposed scheme achieved a high-classification accuracy of 98.7% for the CVD dataset and 98% for the UCI dataset compared to other classifiers, such as support vector machine, K-nearest neighbour, Elman neural network, Gaussian Naive Bayes, logistic regression, random forest, and decision tree.

Abstract Image

基于DFCSS的最优特征选择和改进Elman-SFO分类的心脏病预测
心血管疾病(CVD)的预测是临床数据分析领域的一个关键挑战。本研究提出了一种基于最优特征选择的心脏病预测方法。最初,数据预处理过程使用数据清理、数据转换、缺失值输入和数据规范化来执行。然后在特征选择过程中,采用基于决策函数的混沌萨尔普群(DFCSS)算法来选择最优特征。然后将选择的属性交给改进的Elman神经网络(IENN)进行数据分类。本文采用旗鱼优化(sailfish optimization, SFO)算法计算IENN的最优权值。结合基于dfcss - iann的SFO (IESFO)算法可有效预测心脏病。提出的(DFCSS-IESFO)方法在Python环境中使用两个不同的数据集(如加州大学欧文分校(UCI)克利夫兰心脏病数据集和心血管疾病数据集)实现。仿真结果表明,与支持向量机、k近邻、Elman神经网络、高斯朴素贝叶斯、逻辑回归、随机森林和决策树等分类器相比,该方法对CVD数据集的分类准确率达到98.7%,对UCI数据集的分类准确率达到98%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Accounts of Chemical Research
Accounts of Chemical Research 化学-化学综合
CiteScore
31.40
自引率
1.10%
发文量
312
审稿时长
2 months
期刊介绍: Accounts of Chemical Research presents short, concise and critical articles offering easy-to-read overviews of basic research and applications in all areas of chemistry and biochemistry. These short reviews focus on research from the author’s own laboratory and are designed to teach the reader about a research project. In addition, Accounts of Chemical Research publishes commentaries that give an informed opinion on a current research problem. Special Issues online are devoted to a single topic of unusual activity and significance. Accounts of Chemical Research replaces the traditional article abstract with an article "Conspectus." These entries synopsize the research affording the reader a closer look at the content and significance of an article. Through this provision of a more detailed description of the article contents, the Conspectus enhances the article's discoverability by search engines and the exposure for the research.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信