使用增强分类器预测乳腺癌的机器学习方法

Q4 Engineering
Md. Mijanur Rahman, Zannatul Ferdousi, Puja Saha, R. Mayuri
{"title":"使用增强分类器预测乳腺癌的机器学习方法","authors":"Md. Mijanur Rahman, Zannatul Ferdousi, Puja Saha, R. Mayuri","doi":"10.21817/indjcse/2023/v14i3/231403009","DOIUrl":null,"url":null,"abstract":"Breast cancer is a prevalent disease, with the second highest incidence rate among all types of cancer. The risk of death from breast cancer is increasing due to rapid population growth, and a dependable and quick diagnostic system can assist medical professionals in disease diagnosis and lower the mortality rate. In this study, various machine-learning algorithms are examined for predicting the stages of breast cancer, and most especially in the medical field, where those methods are widely used in diagnosis and analysis for decision-making. We focused on boosting classification models and evaluated the performance of XGBoost, AdaBoost, and Gradient Boosting. Our goal is to achieve higher accuracy by using boosting classifiers with hyperparameter tuning for the prediction of breast cancer stages, precisely the distinction between \"Benign\" and \"Malignant\" types of breast cancer. The Wisconsin breast cancer dataset is employed from the UCI machine learning database. The performance of our model was evaluated using metrics such as accuracy, sensitivity, precision, specificity, AUC, and ROC curves for various strategies. After implementing the model, this study achieved the best model accuracy, and 98.60% was achieved on AdaBoost.","PeriodicalId":52250,"journal":{"name":"Indian Journal of Computer Science and Engineering","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Machine Learning Approach to Predict Breast Cancer Using Boosting Classifiers\",\"authors\":\"Md. Mijanur Rahman, Zannatul Ferdousi, Puja Saha, R. Mayuri\",\"doi\":\"10.21817/indjcse/2023/v14i3/231403009\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Breast cancer is a prevalent disease, with the second highest incidence rate among all types of cancer. The risk of death from breast cancer is increasing due to rapid population growth, and a dependable and quick diagnostic system can assist medical professionals in disease diagnosis and lower the mortality rate. In this study, various machine-learning algorithms are examined for predicting the stages of breast cancer, and most especially in the medical field, where those methods are widely used in diagnosis and analysis for decision-making. We focused on boosting classification models and evaluated the performance of XGBoost, AdaBoost, and Gradient Boosting. Our goal is to achieve higher accuracy by using boosting classifiers with hyperparameter tuning for the prediction of breast cancer stages, precisely the distinction between \\\"Benign\\\" and \\\"Malignant\\\" types of breast cancer. The Wisconsin breast cancer dataset is employed from the UCI machine learning database. The performance of our model was evaluated using metrics such as accuracy, sensitivity, precision, specificity, AUC, and ROC curves for various strategies. After implementing the model, this study achieved the best model accuracy, and 98.60% was achieved on AdaBoost.\",\"PeriodicalId\":52250,\"journal\":{\"name\":\"Indian Journal of Computer Science and Engineering\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Indian Journal of Computer Science and Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.21817/indjcse/2023/v14i3/231403009\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Engineering\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Indian Journal of Computer Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21817/indjcse/2023/v14i3/231403009","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Engineering","Score":null,"Total":0}
引用次数: 0

摘要

乳腺癌是一种流行疾病,在所有类型的癌症中发病率第二高。由于人口的快速增长,乳腺癌的死亡风险正在增加,一个可靠、快速的诊断系统可以帮助医疗专业人员进行疾病诊断,降低死亡率。在这项研究中,研究了各种机器学习算法来预测乳腺癌的阶段,尤其是在医学领域,这些方法被广泛用于诊断和决策分析。我们专注于增强分类模型,并评估了XGBoost、AdaBoost和Gradient boosting的性能。我们的目标是通过使用带有超参数调整的增强分类器来预测乳腺癌的分期,精确地区分“良性”和“恶性”类型的乳腺癌,从而达到更高的准确性。威斯康星乳腺癌数据集来自UCI机器学习数据库。使用各种策略的准确度、灵敏度、精密度、特异性、AUC和ROC曲线等指标来评估我们模型的性能。模型实现后,本研究达到了最好的模型准确率,在AdaBoost上达到了98.60%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Machine Learning Approach to Predict Breast Cancer Using Boosting Classifiers
Breast cancer is a prevalent disease, with the second highest incidence rate among all types of cancer. The risk of death from breast cancer is increasing due to rapid population growth, and a dependable and quick diagnostic system can assist medical professionals in disease diagnosis and lower the mortality rate. In this study, various machine-learning algorithms are examined for predicting the stages of breast cancer, and most especially in the medical field, where those methods are widely used in diagnosis and analysis for decision-making. We focused on boosting classification models and evaluated the performance of XGBoost, AdaBoost, and Gradient Boosting. Our goal is to achieve higher accuracy by using boosting classifiers with hyperparameter tuning for the prediction of breast cancer stages, precisely the distinction between "Benign" and "Malignant" types of breast cancer. The Wisconsin breast cancer dataset is employed from the UCI machine learning database. The performance of our model was evaluated using metrics such as accuracy, sensitivity, precision, specificity, AUC, and ROC curves for various strategies. After implementing the model, this study achieved the best model accuracy, and 98.60% was achieved on AdaBoost.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Indian Journal of Computer Science and Engineering
Indian Journal of Computer Science and Engineering Engineering-Engineering (miscellaneous)
自引率
0.00%
发文量
146
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信