{"title":"A study on using Python vs Weka on dialysis data analysis","authors":"J. Mitrpanont, Wudhichart Sawangphol, Thanita Vithantirawat, Sinattaya Paengkaew, Prameyuda Suwannasing, Atthapan Daramas, Yi-Cheng Chen","doi":"10.1109/INCIT.2017.8257883","DOIUrl":null,"url":null,"abstract":"Health data has been drastically increasing in capacity and variety. Due to large and complex collection of datasets, it is difficult to process data using traditional data processing techniques. Machine Learning techniques, such as KNN and Naïve Bayes, have been used. Python and Weka are tools that are widely used in the field of data analytics. Therefore, this paper gives the comprehensive comparison between both tools together with some machine learning algorithms on data analytic of Dialysis Dataset. The results show that using Python provides the better performance in term of correct/incorrect instances, precision, and recall.","PeriodicalId":405827,"journal":{"name":"2017 2nd International Conference on Information Technology (INCIT)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 2nd International Conference on Information Technology (INCIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INCIT.2017.8257883","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
Abstract
Health data has been drastically increasing in capacity and variety. Due to large and complex collection of datasets, it is difficult to process data using traditional data processing techniques. Machine Learning techniques, such as KNN and Naïve Bayes, have been used. Python and Weka are tools that are widely used in the field of data analytics. Therefore, this paper gives the comprehensive comparison between both tools together with some machine learning algorithms on data analytic of Dialysis Dataset. The results show that using Python provides the better performance in term of correct/incorrect instances, precision, and recall.