Efficient heart disease prediction-based on optimal feature selection using DFCSS and classification by improved Elman-SFO

IF 1.9 4区 生物学 Q4 CELL BIOLOGY
Jaishri Wankhede, Magesh Kumar, Palaniappan Sambandam
{"title":"Efficient heart disease prediction-based on optimal feature selection using DFCSS and classification by improved Elman-SFO","authors":"Jaishri Wankhede,&nbsp;Magesh Kumar,&nbsp;Palaniappan Sambandam","doi":"10.1049/iet-syb.2020.0041","DOIUrl":null,"url":null,"abstract":"<div>\n <p>Prediction of cardiovascular disease (CVD) is a critical challenge in the area of clinical data analysis. In this study, an efficient heart disease prediction is developed based on optimal feature selection. Initially, the data pre-processing process is performed using data cleaning, data transformation, missing values imputation, and data normalisation. Then the decision function-based chaotic salp swarm (DFCSS) algorithm is used to select the optimal features in the feature selection process. Then the chosen attributes are given to the improved Elman neural network (IENN) for data classification. Here, the sailfish optimisation (SFO) algorithm is used to compute the optimal weight value of IENN. The combination of DFCSS–IENN-based SFO (IESFO) algorithm effectively predicts heart disease. The proposed (DFCSS–IESFO) approach is implemented in the Python environment using two different datasets such as the University of California Irvine (UCI) Cleveland heart disease dataset and CVD dataset. The simulation results proved that the proposed scheme achieved a high-classification accuracy of 98.7% for the CVD dataset and 98% for the UCI dataset compared to other classifiers, such as support vector machine, K-nearest neighbour, Elman neural network, Gaussian Naive Bayes, logistic regression, random forest, and decision tree.</p>\n </div>","PeriodicalId":50379,"journal":{"name":"IET Systems Biology","volume":null,"pages":null},"PeriodicalIF":1.9000,"publicationDate":"2020-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8687167/pdf/SYB2-14-380.pdf","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Systems Biology","FirstCategoryId":"99","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/iet-syb.2020.0041","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"CELL BIOLOGY","Score":null,"Total":0}
引用次数: 11

Abstract

Prediction of cardiovascular disease (CVD) is a critical challenge in the area of clinical data analysis. In this study, an efficient heart disease prediction is developed based on optimal feature selection. Initially, the data pre-processing process is performed using data cleaning, data transformation, missing values imputation, and data normalisation. Then the decision function-based chaotic salp swarm (DFCSS) algorithm is used to select the optimal features in the feature selection process. Then the chosen attributes are given to the improved Elman neural network (IENN) for data classification. Here, the sailfish optimisation (SFO) algorithm is used to compute the optimal weight value of IENN. The combination of DFCSS–IENN-based SFO (IESFO) algorithm effectively predicts heart disease. The proposed (DFCSS–IESFO) approach is implemented in the Python environment using two different datasets such as the University of California Irvine (UCI) Cleveland heart disease dataset and CVD dataset. The simulation results proved that the proposed scheme achieved a high-classification accuracy of 98.7% for the CVD dataset and 98% for the UCI dataset compared to other classifiers, such as support vector machine, K-nearest neighbour, Elman neural network, Gaussian Naive Bayes, logistic regression, random forest, and decision tree.

Abstract Image

基于DFCSS的最优特征选择和改进Elman-SFO分类的心脏病预测
心血管疾病(CVD)的预测是临床数据分析领域的一个关键挑战。本研究提出了一种基于最优特征选择的心脏病预测方法。最初,数据预处理过程使用数据清理、数据转换、缺失值输入和数据规范化来执行。然后在特征选择过程中,采用基于决策函数的混沌萨尔普群(DFCSS)算法来选择最优特征。然后将选择的属性交给改进的Elman神经网络(IENN)进行数据分类。本文采用旗鱼优化(sailfish optimization, SFO)算法计算IENN的最优权值。结合基于dfcss - iann的SFO (IESFO)算法可有效预测心脏病。提出的(DFCSS-IESFO)方法在Python环境中使用两个不同的数据集(如加州大学欧文分校(UCI)克利夫兰心脏病数据集和心血管疾病数据集)实现。仿真结果表明,与支持向量机、k近邻、Elman神经网络、高斯朴素贝叶斯、逻辑回归、随机森林和决策树等分类器相比,该方法对CVD数据集的分类准确率达到98.7%,对UCI数据集的分类准确率达到98%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IET Systems Biology
IET Systems Biology 生物-数学与计算生物学
CiteScore
4.20
自引率
4.30%
发文量
17
审稿时长
>12 weeks
期刊介绍: IET Systems Biology covers intra- and inter-cellular dynamics, using systems- and signal-oriented approaches. Papers that analyse genomic data in order to identify variables and basic relationships between them are considered if the results provide a basis for mathematical modelling and simulation of cellular dynamics. Manuscripts on molecular and cell biological studies are encouraged if the aim is a systems approach to dynamic interactions within and between cells. The scope includes the following topics: Genomics, transcriptomics, proteomics, metabolomics, cells, tissue and the physiome; molecular and cellular interaction, gene, cell and protein function; networks and pathways; metabolism and cell signalling; dynamics, regulation and control; systems, signals, and information; experimental data analysis; mathematical modelling, simulation and theoretical analysis; biological modelling, simulation, prediction and control; methodologies, databases, tools and algorithms for modelling and simulation; modelling, analysis and control of biological networks; synthetic biology and bioengineering based on systems biology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信