M. M. Hossain, Md. Rana Ahmed, M. Hasan, M. Sultana, K. Fatema
{"title":"Liable Characteristics Measure and Anticipate the Diabetes Disease Using Machine Learning Tools","authors":"M. M. Hossain, Md. Rana Ahmed, M. Hasan, M. Sultana, K. Fatema","doi":"10.28924/ada/stat.3.2","DOIUrl":null,"url":null,"abstract":"Diabetes is a cardiovascular disease. It is not only an epidemic in Bangladesh but also in the whole world that is increasing rapidly. At an early period of human life, machine learning techniques are used to predict diabetes datasets. In our research paper, we use the Pima diabetes dataset from the Kaggle UCI machine learning data repository. For diabetic patients and doctors, machine learning techniques are both cost-effective and time-saving. We apply KNN, Nave Bayes, Random forest, Support vector machine, Simple logistic, and J48 to Pima datasets. Besides these algorithms, we may develop an ensemble (Vote) hybrid model with WEKA software by combining individual methods that provide the best performance and accuracy. Also, try to make a comparison among all machine learning tool’s accuracy and performance with the proposed ensemble model.","PeriodicalId":153849,"journal":{"name":"European Journal of Statistics","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.28924/ada/stat.3.2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Diabetes is a cardiovascular disease. It is not only an epidemic in Bangladesh but also in the whole world that is increasing rapidly. At an early period of human life, machine learning techniques are used to predict diabetes datasets. In our research paper, we use the Pima diabetes dataset from the Kaggle UCI machine learning data repository. For diabetic patients and doctors, machine learning techniques are both cost-effective and time-saving. We apply KNN, Nave Bayes, Random forest, Support vector machine, Simple logistic, and J48 to Pima datasets. Besides these algorithms, we may develop an ensemble (Vote) hybrid model with WEKA software by combining individual methods that provide the best performance and accuracy. Also, try to make a comparison among all machine learning tool’s accuracy and performance with the proposed ensemble model.