Breast Cancer Risk Prediction based on Six Machine Learning Algorithms

M. Razu Ahmed, Md. Asraf Ali, Joy Roy, Shakil Ahmed, N. Ahmed
{"title":"Breast Cancer Risk Prediction based on Six Machine Learning Algorithms","authors":"M. Razu Ahmed, Md. Asraf Ali, Joy Roy, Shakil Ahmed, N. Ahmed","doi":"10.1109/CSDE50874.2020.9411572","DOIUrl":null,"url":null,"abstract":"Breast Cancer is the second most important cause of death among women. As per the clinical expert, breast cancer is one of prominent cancers after lung cancer. However, early detection of this type of cancer in its initial stage helps to save lifes and increases lifespan. The survival chance of a patient can increase if there is a classifier that helps with a quick prediction of breast cancer. Therefore, a smart framework is required that can effectively detect and predict with high accuracy early stage of breast cancer. In this article, six machine learning classification algorithms, namely Logistic Regression (LR), K-Nearest Neighbours (kNN), Decision Tree (DT), Support Vector Machine (SVM), Naive Bayes (NB), and Random Forest (RF) are implemented in order to evaluate the performance and the prediction power of the model. The main target of this work is to compare these algorithm performances using the Wisconsin Breast Cancer (original) dataset. The number of performance metrics such as accuracy, precision, recall, f-1 score, and specificity are taken into consideration Our analysis of the results shows that the Support Vector Machine achieved the highest accuracy of 97.07% with the least error rate and Naive Bayes gives the lowest accuracy of 96%. All these experiments were carried out using SciKit.","PeriodicalId":445708,"journal":{"name":"2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSDE50874.2020.9411572","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

Breast Cancer is the second most important cause of death among women. As per the clinical expert, breast cancer is one of prominent cancers after lung cancer. However, early detection of this type of cancer in its initial stage helps to save lifes and increases lifespan. The survival chance of a patient can increase if there is a classifier that helps with a quick prediction of breast cancer. Therefore, a smart framework is required that can effectively detect and predict with high accuracy early stage of breast cancer. In this article, six machine learning classification algorithms, namely Logistic Regression (LR), K-Nearest Neighbours (kNN), Decision Tree (DT), Support Vector Machine (SVM), Naive Bayes (NB), and Random Forest (RF) are implemented in order to evaluate the performance and the prediction power of the model. The main target of this work is to compare these algorithm performances using the Wisconsin Breast Cancer (original) dataset. The number of performance metrics such as accuracy, precision, recall, f-1 score, and specificity are taken into consideration Our analysis of the results shows that the Support Vector Machine achieved the highest accuracy of 97.07% with the least error rate and Naive Bayes gives the lowest accuracy of 96%. All these experiments were carried out using SciKit.
基于六种机器学习算法的乳腺癌风险预测
乳腺癌是妇女死亡的第二大原因。据临床专家介绍,乳腺癌是仅次于肺癌的主要癌症之一。然而,在早期发现这种类型的癌症有助于挽救生命,延长寿命。如果有一个分类器可以帮助快速预测乳腺癌,患者的生存机会就会增加。因此,需要一种能够有效、高精度地检测和预测早期乳腺癌的智能框架。本文采用逻辑回归(LR)、k近邻(kNN)、决策树(DT)、支持向量机(SVM)、朴素贝叶斯(NB)和随机森林(RF)六种机器学习分类算法来评估模型的性能和预测能力。这项工作的主要目标是使用威斯康星州乳腺癌(原始)数据集比较这些算法的性能。分析结果表明,支持向量机的准确率最高,达到97.07%,错误率最低,朴素贝叶斯的准确率最低,为96%。所有实验均使用SciKit进行。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信