{"title":"Comparative study on data mining classification methods for cervical cancer prediction using pap smear results","authors":"Y. Kurniawati, A. E. Permanasari, S. Fauziati","doi":"10.1109/IBIOMED.2016.7869827","DOIUrl":null,"url":null,"abstract":"The number of woman with cervical cancer in Indonesia is getting higher. Indonesia becomes the country with the highest number of women with cervical cancer in the world. Cervical cancer became the highest cause of cancer deaths in women globally. There has been a lot of research using data mining techniques with variety of different data mining models that can be used for analyzing cervical cancer. In this research, data that be used were obtained from the medical records of the Pap smear test results. There are 38 symptoms and 7 classes. Naïve Bayes, Support Vector Machines (SVM), and Random Forest Tree was used to evaluate the performance of the classifier. The performance matric that used in this study are accuracy, recall, precision, and ROC curve. Based on the performance matric, Random Forest Tree is the best classifier among other classifiers to classify Pap smear results.","PeriodicalId":171132,"journal":{"name":"2016 1st International Conference on Biomedical Engineering (IBIOMED)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 1st International Conference on Biomedical Engineering (IBIOMED)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IBIOMED.2016.7869827","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 22
Abstract
The number of woman with cervical cancer in Indonesia is getting higher. Indonesia becomes the country with the highest number of women with cervical cancer in the world. Cervical cancer became the highest cause of cancer deaths in women globally. There has been a lot of research using data mining techniques with variety of different data mining models that can be used for analyzing cervical cancer. In this research, data that be used were obtained from the medical records of the Pap smear test results. There are 38 symptoms and 7 classes. Naïve Bayes, Support Vector Machines (SVM), and Random Forest Tree was used to evaluate the performance of the classifier. The performance matric that used in this study are accuracy, recall, precision, and ROC curve. Based on the performance matric, Random Forest Tree is the best classifier among other classifiers to classify Pap smear results.