A Study of Dimensionality Reduction’s Influence on Heart Disease Prediction

2021 12th International Conference on Information, Intelligence, Systems & Applications (IISA) Pub Date : 2021-07-12 DOI:10.1109/IISA52424.2021.9555550

Gaoshuai Wang, Fabrice Lauri, A. Hassani

{"title":"A Study of Dimensionality Reduction’s Influence on Heart Disease Prediction","authors":"Gaoshuai Wang, Fabrice Lauri, A. Hassani","doi":"10.1109/IISA52424.2021.9555550","DOIUrl":null,"url":null,"abstract":"Heart disease is a serious threat to human life due to its suddenness and ponderance. It’s urgent and meaningful to build a diagnosis system to detect heart disease earlier and accurately. In the field of medicine, doctors have summarized lots of experience on heart disease diagnosis. Duo to a large number of samples and attributes, the work done by the human is not efficient. And, computer-aided disease diagnosis has shown its advantages. Many researchers have applied machine learning methods to heart disease detection. For pursuing better performance, dimensionality reduction methods are often used for selecting key features or accelerating the processing speed. In this research, we investigate the influence of dimensionality reduction by using PCA and LDA methods on the machine learning methods’ prediction. PCA and LDA represent two famous dimensionality reduction, unsupervised and supervised methods. The results display that the performance of PCA is better than LDA’s evaluated by several metrics. Additionally, PCA indeed promotes many different methods’ prediction effects. There is an optimal amount of features when using PCA. It seems that the dataset with more features is easy to obtain better results. Otherwise, the dataset itself also has a significant influence on prediction result even their structures are the same. The dimensionality reduction will influence the time consumption of machine learning methods. Finally, we reveal that complex models are not always better than simple ones.","PeriodicalId":437496,"journal":{"name":"2021 12th International Conference on Information, Intelligence, Systems & Applications (IISA)","volume":"493 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 12th International Conference on Information, Intelligence, Systems & Applications (IISA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IISA52424.2021.9555550","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Heart disease is a serious threat to human life due to its suddenness and ponderance. It’s urgent and meaningful to build a diagnosis system to detect heart disease earlier and accurately. In the field of medicine, doctors have summarized lots of experience on heart disease diagnosis. Duo to a large number of samples and attributes, the work done by the human is not efficient. And, computer-aided disease diagnosis has shown its advantages. Many researchers have applied machine learning methods to heart disease detection. For pursuing better performance, dimensionality reduction methods are often used for selecting key features or accelerating the processing speed. In this research, we investigate the influence of dimensionality reduction by using PCA and LDA methods on the machine learning methods’ prediction. PCA and LDA represent two famous dimensionality reduction, unsupervised and supervised methods. The results display that the performance of PCA is better than LDA’s evaluated by several metrics. Additionally, PCA indeed promotes many different methods’ prediction effects. There is an optimal amount of features when using PCA. It seems that the dataset with more features is easy to obtain better results. Otherwise, the dataset itself also has a significant influence on prediction result even their structures are the same. The dimensionality reduction will influence the time consumption of machine learning methods. Finally, we reveal that complex models are not always better than simple ones.

查看原文本刊更多论文

降维对心脏病预测影响的研究

心脏病因其突发性和严重性严重威胁着人类的生命。建立一个早期、准确地发现心脏病的诊断系统是迫切而有意义的。在医学领域，医生们对心脏病的诊断总结了许多经验。对于大量的样本和属性，人工完成的工作效率不高。计算机辅助疾病诊断已显示出其优势。许多研究人员已经将机器学习方法应用于心脏病检测。为了追求更好的性能，通常使用降维方法来选择关键特征或加快处理速度。在本研究中，我们研究了使用PCA和LDA方法进行降维对机器学习方法预测的影响。PCA和LDA代表了两种著名的降维方法:无监督降维和有监督降维。结果表明，主成分分析法的性能优于LDA分析法。此外，PCA确实促进了许多不同方法的预测效果。当使用PCA时，有一个最优的特征量。似乎特征越多的数据集越容易获得更好的结果。否则，即使数据集本身的结构相同，对预测结果也有显著的影响。降维会影响机器学习方法的时间消耗。最后，我们发现复杂模型并不总是比简单模型好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 12th International Conference on Information, Intelligence, Systems & Applications (IISA)

自引率

0.00%

发文量