A survey on Machine Learning Classifiers and Big data for Accurate and Reliable Heart Disease Pre-diagnosis

International Journal of Advanced Research in Big Data Management System Pub Date : 2019-11-30 DOI:10.21742/ijarbms.2019.3.2.04

Srikanth Meda

{"title":"A survey on Machine Learning Classifiers and Big data for Accurate and Reliable Heart Disease Pre-diagnosis","authors":"Srikanth Meda","doi":"10.21742/ijarbms.2019.3.2.04","DOIUrl":null,"url":null,"abstract":"Since a decade, emergence of interdisciplinary computer technologies changed the pace of medical diagnosis systems by insisting up-to-date intelligence and supervision. These intellectual systems predict the future health problems by processing the current health information of patients, which helps in prevention of diseases rather than cure. Although the medical diagnosis systems are adequate intelligent in disease diagnosis, but they are still suffering in pre-diagnosis of diseases due to the complexity in processing of huge medical datasets. Recently introduced Data Mining techniques with Big Data processing environment expanded the horizons of medical diagnosis systems to process the high velocity medical data sets to diagnose the occurrence of diseases early. Today’s medical diagnosis systems, which are utilizing different data mining techniques like Decision Trees (DT), Support Vector Machines (SVM), Naïve Bayes (NB), Fuzzy Logics and K-Nearest Neighbor (KNN), are suffering from uncertainty, imprecision and complexity in processing. In this paper we are proposing a cross reference methodology to improvise the reliability and precision of diagnostic results and utilizing big data tools to diminish the complexity in processing huge sets of medical data. Most popular data mining techniques, which are participating in medical data processing with high accuracy are selected and cross referenced by our proposed framework to overcome uncertainty and imprecision. In order to process the high velocity medical datasets with several data mining techniques, this frame work outsources the data processing business to Apache Hadoop environment. Experiments on Cleveland medical dataset proved that the proposed cross reference methodology framework recorded high precision, recall and accuracy in results than its counterparts.","PeriodicalId":377126,"journal":{"name":"International Journal of Advanced Research in Big Data Management System","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Advanced Research in Big Data Management System","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21742/ijarbms.2019.3.2.04","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Since a decade, emergence of interdisciplinary computer technologies changed the pace of medical diagnosis systems by insisting up-to-date intelligence and supervision. These intellectual systems predict the future health problems by processing the current health information of patients, which helps in prevention of diseases rather than cure. Although the medical diagnosis systems are adequate intelligent in disease diagnosis, but they are still suffering in pre-diagnosis of diseases due to the complexity in processing of huge medical datasets. Recently introduced Data Mining techniques with Big Data processing environment expanded the horizons of medical diagnosis systems to process the high velocity medical data sets to diagnose the occurrence of diseases early. Today’s medical diagnosis systems, which are utilizing different data mining techniques like Decision Trees (DT), Support Vector Machines (SVM), Naïve Bayes (NB), Fuzzy Logics and K-Nearest Neighbor (KNN), are suffering from uncertainty, imprecision and complexity in processing. In this paper we are proposing a cross reference methodology to improvise the reliability and precision of diagnostic results and utilizing big data tools to diminish the complexity in processing huge sets of medical data. Most popular data mining techniques, which are participating in medical data processing with high accuracy are selected and cross referenced by our proposed framework to overcome uncertainty and imprecision. In order to process the high velocity medical datasets with several data mining techniques, this frame work outsources the data processing business to Apache Hadoop environment. Experiments on Cleveland medical dataset proved that the proposed cross reference methodology framework recorded high precision, recall and accuracy in results than its counterparts.

查看原文本刊更多论文

机器学习分类器与大数据在心脏病准确可靠预诊断中的应用研究

近十年来，跨学科计算机技术的出现改变了医疗诊断系统的步伐，坚持了最新的智能和监控。这些智能系统通过处理患者当前的健康信息来预测未来的健康问题，这有助于预防而不是治疗疾病。虽然医疗诊断系统在疾病诊断方面具有足够的智能，但由于庞大的医疗数据集处理的复杂性，在疾病的预诊断方面仍然存在问题。最近引入的具有大数据处理环境的数据挖掘技术扩展了医疗诊断系统的视野，可以处理高速的医疗数据集来早期诊断疾病的发生。今天的医疗诊断系统使用不同的数据挖掘技术，如决策树(DT)，支持向量机(SVM)， Naïve贝叶斯(NB)，模糊逻辑和k -最近邻(KNN)，在处理过程中存在不确定性，不精确和复杂性。在本文中，我们提出了一种交叉参考方法来提高诊断结果的可靠性和准确性，并利用大数据工具来减少处理大量医疗数据的复杂性。本文提出的框架选择了目前医学数据处理中最流行的、精度较高的数据挖掘技术，并对其进行了交叉引用，以克服不确定性和不精确性。为了利用多种数据挖掘技术对高速医疗数据集进行处理，该框架将数据处理业务外包给Apache Hadoop环境。在Cleveland医学数据集上的实验证明，所提出的交叉参考方法框架在结果上具有较高的精密度、查全率和准确率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Advanced Research in Big Data Management System

自引率

0.00%

发文量