M. Gokiladevi, Sundar Santhoshkumar, Vijayakumar Varadarajan
{"title":"MACHINE LEARNING ALGORITHM SELECTION FOR CHRONIC KIDNEY DISEASE DIAGNOSIS AND CLASSIFICATION","authors":"M. Gokiladevi, Sundar Santhoshkumar, Vijayakumar Varadarajan","doi":"10.22452/mjcs.sp2022no1.8","DOIUrl":null,"url":null,"abstract":"In last decades, chronic kidney disease (CKD) becomes a global health problem that is steadily developing worldwide. It is a chronic illness highly related to increased morbidity and mortality, cardiovascular diseases, and high healthcare cost. Earlier identification and classification of CKD is treated as a major factor in controlling the mortality rate. Data mining (DM) techniques are used for the extraction of hidden details from the clinical and laboratory patient data that is used to aid doctors in enhancing diagnostic accuracy. Recently, machine learning (ML) techniques are commonly employed for the prediction and classification of diseases in healthcare sector. With this motivation, this study examines the performance of different ML algorithms to diagnose CKD at the earlier stages. The proposed model involves data pre-processing in two stages such as missing value replacement and data transformation. Besides, a set of five ML based classification models are involved such as support vector machine (SVM), random forest (RF), logistic regression (LR), K-nearest neighbor (KNN), and decision tree (DT). For investigating the performance of the different ML models, a benchmark CKD dataset from UCI repository is employed and the results are examined under different aspects. Among the different classifiers, the RF model has accomplished superior results with the maximum precision of 0.99, recall of 0.99, and F-score of 0.99 with a minimal error rate of 0.012.","PeriodicalId":49894,"journal":{"name":"Malaysian Journal of Computer Science","volume":" ","pages":""},"PeriodicalIF":1.1000,"publicationDate":"2022-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Malaysian Journal of Computer Science","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.22452/mjcs.sp2022no1.8","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 2
Abstract
In last decades, chronic kidney disease (CKD) becomes a global health problem that is steadily developing worldwide. It is a chronic illness highly related to increased morbidity and mortality, cardiovascular diseases, and high healthcare cost. Earlier identification and classification of CKD is treated as a major factor in controlling the mortality rate. Data mining (DM) techniques are used for the extraction of hidden details from the clinical and laboratory patient data that is used to aid doctors in enhancing diagnostic accuracy. Recently, machine learning (ML) techniques are commonly employed for the prediction and classification of diseases in healthcare sector. With this motivation, this study examines the performance of different ML algorithms to diagnose CKD at the earlier stages. The proposed model involves data pre-processing in two stages such as missing value replacement and data transformation. Besides, a set of five ML based classification models are involved such as support vector machine (SVM), random forest (RF), logistic regression (LR), K-nearest neighbor (KNN), and decision tree (DT). For investigating the performance of the different ML models, a benchmark CKD dataset from UCI repository is employed and the results are examined under different aspects. Among the different classifiers, the RF model has accomplished superior results with the maximum precision of 0.99, recall of 0.99, and F-score of 0.99 with a minimal error rate of 0.012.
期刊介绍:
The Malaysian Journal of Computer Science (ISSN 0127-9084) is published four times a year in January, April, July and October by the Faculty of Computer Science and Information Technology, University of Malaya, since 1985. Over the years, the journal has gained popularity and the number of paper submissions has increased steadily. The rigorous reviews from the referees have helped in ensuring that the high standard of the journal is maintained. The objectives are to promote exchange of information and knowledge in research work, new inventions/developments of Computer Science and on the use of Information Technology towards the structuring of an information-rich society and to assist the academic staff from local and foreign universities, business and industrial sectors, government departments and academic institutions on publishing research results and studies in Computer Science and Information Technology through a scholarly publication. The journal is being indexed and abstracted by Clarivate Analytics'' Web of Science and Elsevier''s Scopus