Afrah Rashid, Syeda Sohana Binta Farhad, Afsana Bhuyian, N. Yeasmin, Mohammad Abdul Azim, Z. Alom
{"title":"使用WEKA进行乳腺癌诊断的机器学习技术比较分析","authors":"Afrah Rashid, Syeda Sohana Binta Farhad, Afsana Bhuyian, N. Yeasmin, Mohammad Abdul Azim, Z. Alom","doi":"10.1109/ICCIT57492.2022.10055421","DOIUrl":null,"url":null,"abstract":"Breast cancer is one of the most common malignancies affecting women worldwide, with many fatalities yearly. The risk of death suffered by breast cancer is increasing exponentially. Due to a surge of development of research in the medical field, providing more timely and possible early detection of disease has become a time-demanding option. By far, radiologists have manually checked cancer images and diagnosed them. Research has shown that a considerable number of ultrasound images are created every individual day. However, the number of radiologists is limited, so they cannot provide service on time. However, they often misclassify breast lesions, resulting in a high false-positive rate. An automatic system for detecting disease assists radiologists in disease diagnosis and provides reliable, productive, and reduces the risk of death. In this paper, we compare six machine learning models, namely (i) Support Vector Machine (SVM), (ii) Naive Bayes (NB), (iii) Logistic Regression (LR), (iv) Decision Tree (DT), (v) Random Forest (RF), and (vi) k-Nearest Neighbors (k-NN) on two different datasets (i) the Wisconsin Breast Cancer Dataset (WBCD) and (ii) the Breast Cancer Coimbra Dataset (BCCD). This study aims to create different classification models to analyze the obtained results and compare them to predict breast cancer. We use several performance metrics to select the best classification model among them. Our comparative analysis shows that SVM models can achieve better performance metrics, and thus the model of this research possesses relevant to use in clinical applications.","PeriodicalId":255498,"journal":{"name":"2022 25th International Conference on Computer and Information Technology (ICCIT)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Comparative Analysis of Machine Learning techniques on Breast Cancer diagnosis using WEKA\",\"authors\":\"Afrah Rashid, Syeda Sohana Binta Farhad, Afsana Bhuyian, N. Yeasmin, Mohammad Abdul Azim, Z. Alom\",\"doi\":\"10.1109/ICCIT57492.2022.10055421\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Breast cancer is one of the most common malignancies affecting women worldwide, with many fatalities yearly. The risk of death suffered by breast cancer is increasing exponentially. Due to a surge of development of research in the medical field, providing more timely and possible early detection of disease has become a time-demanding option. By far, radiologists have manually checked cancer images and diagnosed them. Research has shown that a considerable number of ultrasound images are created every individual day. However, the number of radiologists is limited, so they cannot provide service on time. However, they often misclassify breast lesions, resulting in a high false-positive rate. An automatic system for detecting disease assists radiologists in disease diagnosis and provides reliable, productive, and reduces the risk of death. In this paper, we compare six machine learning models, namely (i) Support Vector Machine (SVM), (ii) Naive Bayes (NB), (iii) Logistic Regression (LR), (iv) Decision Tree (DT), (v) Random Forest (RF), and (vi) k-Nearest Neighbors (k-NN) on two different datasets (i) the Wisconsin Breast Cancer Dataset (WBCD) and (ii) the Breast Cancer Coimbra Dataset (BCCD). This study aims to create different classification models to analyze the obtained results and compare them to predict breast cancer. We use several performance metrics to select the best classification model among them. Our comparative analysis shows that SVM models can achieve better performance metrics, and thus the model of this research possesses relevant to use in clinical applications.\",\"PeriodicalId\":255498,\"journal\":{\"name\":\"2022 25th International Conference on Computer and Information Technology (ICCIT)\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 25th International Conference on Computer and Information Technology (ICCIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCIT57492.2022.10055421\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 25th International Conference on Computer and Information Technology (ICCIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIT57492.2022.10055421","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Comparative Analysis of Machine Learning techniques on Breast Cancer diagnosis using WEKA
Breast cancer is one of the most common malignancies affecting women worldwide, with many fatalities yearly. The risk of death suffered by breast cancer is increasing exponentially. Due to a surge of development of research in the medical field, providing more timely and possible early detection of disease has become a time-demanding option. By far, radiologists have manually checked cancer images and diagnosed them. Research has shown that a considerable number of ultrasound images are created every individual day. However, the number of radiologists is limited, so they cannot provide service on time. However, they often misclassify breast lesions, resulting in a high false-positive rate. An automatic system for detecting disease assists radiologists in disease diagnosis and provides reliable, productive, and reduces the risk of death. In this paper, we compare six machine learning models, namely (i) Support Vector Machine (SVM), (ii) Naive Bayes (NB), (iii) Logistic Regression (LR), (iv) Decision Tree (DT), (v) Random Forest (RF), and (vi) k-Nearest Neighbors (k-NN) on two different datasets (i) the Wisconsin Breast Cancer Dataset (WBCD) and (ii) the Breast Cancer Coimbra Dataset (BCCD). This study aims to create different classification models to analyze the obtained results and compare them to predict breast cancer. We use several performance metrics to select the best classification model among them. Our comparative analysis shows that SVM models can achieve better performance metrics, and thus the model of this research possesses relevant to use in clinical applications.