Deep Learning Based Decision Support Framework for Cardiovascular Disease Prediction

2021 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE) Pub Date : 2021-12-08 DOI:10.1109/CSDE53843.2021.9718459

Nitten Singh Rajjliwal, G. Chetty

{"title":"Deep Learning Based Decision Support Framework for Cardiovascular Disease Prediction","authors":"Nitten Singh Rajjliwal, G. Chetty","doi":"10.1109/CSDE53843.2021.9718459","DOIUrl":null,"url":null,"abstract":"In this paper we propose a novel decision support framework based on deep learning for cardiovascular disease prediction. The proposed framework based on an innovative stacked dense neural layer and convolution neural network cascade architecture, addresses the significant imbalance in class distribution in CVD event detection task. The experimental evaluation of the proposed model was done on the NHANES super-dataset, obtained by fusion of different subsets of publicly NHANES (National Health and Nutrition Examination Survey) data for prediction of cardiovascular disease. Many machines and deep learning models have been proposed in the literature for CVD event detection. However, they assume balanced class distribution between positive and negative disease classes. For clinical settings, there is significant class imbalance, with few positive class samples as compared to abundant samples from normal or control class. Hence most of the traditional machine and deep learning models are vulnerable to class imbalance, even after using class-specific adjustment of weights (well established method for handling class imbalance) and can lead to poor performance for the minority class detection. The proposed model based on stacked-Dense-CNN cascade architecture is robust and resilient to the class imbalance and has better overall detection accuracy. The first stage of the stacked-Dense-CNN cascade consists of an optimal feature learning stage, comprising a LASSO (least absolute shrinkage and selection) and majority voting step, for extraction of significant and homogenized features. The second stage use of a novel stacked-Dense-CNN cascade model and a novel model development protocol involving an unique train-test dataset partitioning strategy. Also, by using a specific training routine per epoch, similar to the simulated annealing approach, it was possible to achieve enhanced detection performance, particularly for detection of minority class, and robustness to class imbalance. The experimental evaluation of the novel stacked-Dense-CNN cascade model on a super dataset obtained by fusing multiple data subsets of publicly available NHANES data, resulted in an accuracy of 81.8% accuracy for negative CVD cases (majority class), and 85% for the positive CVD cases (minority class), an improved performance as compared to previously proposed research approaches for imbalanced clinical data settings.","PeriodicalId":166950,"journal":{"name":"2021 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSDE53843.2021.9718459","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In this paper we propose a novel decision support framework based on deep learning for cardiovascular disease prediction. The proposed framework based on an innovative stacked dense neural layer and convolution neural network cascade architecture, addresses the significant imbalance in class distribution in CVD event detection task. The experimental evaluation of the proposed model was done on the NHANES super-dataset, obtained by fusion of different subsets of publicly NHANES (National Health and Nutrition Examination Survey) data for prediction of cardiovascular disease. Many machines and deep learning models have been proposed in the literature for CVD event detection. However, they assume balanced class distribution between positive and negative disease classes. For clinical settings, there is significant class imbalance, with few positive class samples as compared to abundant samples from normal or control class. Hence most of the traditional machine and deep learning models are vulnerable to class imbalance, even after using class-specific adjustment of weights (well established method for handling class imbalance) and can lead to poor performance for the minority class detection. The proposed model based on stacked-Dense-CNN cascade architecture is robust and resilient to the class imbalance and has better overall detection accuracy. The first stage of the stacked-Dense-CNN cascade consists of an optimal feature learning stage, comprising a LASSO (least absolute shrinkage and selection) and majority voting step, for extraction of significant and homogenized features. The second stage use of a novel stacked-Dense-CNN cascade model and a novel model development protocol involving an unique train-test dataset partitioning strategy. Also, by using a specific training routine per epoch, similar to the simulated annealing approach, it was possible to achieve enhanced detection performance, particularly for detection of minority class, and robustness to class imbalance. The experimental evaluation of the novel stacked-Dense-CNN cascade model on a super dataset obtained by fusing multiple data subsets of publicly available NHANES data, resulted in an accuracy of 81.8% accuracy for negative CVD cases (majority class), and 85% for the positive CVD cases (minority class), an improved performance as compared to previously proposed research approaches for imbalanced clinical data settings.

查看原文本刊更多论文

基于深度学习的心血管疾病预测决策支持框架

本文提出了一种基于深度学习的心血管疾病预测决策支持框架。该框架基于一种新颖的堆叠密集神经层和卷积神经网络级联结构，解决了CVD事件检测任务中类分布严重不平衡的问题。该模型在NHANES超级数据集上进行了实验评估，该数据集是通过融合NHANES(国家健康与营养调查)公开数据的不同子集获得的，用于预测心血管疾病。文献中已经提出了许多用于CVD事件检测的机器和深度学习模型。然而，它们假设阳性和阴性疾病类别之间的阶级分布是平衡的。在临床环境中，存在明显的类别不平衡，与正常或对照类别的大量样本相比，阳性类别样本很少。因此，大多数传统的机器和深度学习模型容易受到类不平衡的影响，即使在使用特定于类的权重调整(处理类不平衡的成熟方法)之后，也可能导致少数类检测的性能不佳。该模型基于stacking - dense - cnn级联结构，对类不平衡具有鲁棒性和弹性，具有更好的整体检测精度。堆叠- dense - cnn级联的第一阶段由最优特征学习阶段组成，包括LASSO(最小绝对收缩和选择)和多数投票步骤，用于提取重要和均匀化的特征。第二阶段使用了一种新的堆叠-密集- cnn级联模型和一种新的模型开发协议，该协议涉及一种独特的训练-测试数据集划分策略。此外，通过使用每个历元的特定训练例程，类似于模拟退火方法，可以实现增强的检测性能，特别是对于少数类的检测，以及对类不平衡的鲁棒性。在融合公开可用的NHANES数据的多个数据子集获得的超级数据集上，对新型堆叠- dense - cnn级联模型进行了实验评估，结果表明，阴性CVD病例(多数类别)的准确率为81.8%，阳性CVD病例(少数类别)的准确率为85%，与之前提出的针对不平衡临床数据设置的研究方法相比，性能有所提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)

自引率

0.00%

发文量