Early Detection of Breast Cancer Tumors using Linear Discriminant Analysis Feature Selection with Different Machine Learning Classification Methods

M. Abbas, Hamid Ghous
{"title":"Early Detection of Breast Cancer Tumors using Linear Discriminant Analysis Feature Selection with Different Machine Learning Classification Methods","authors":"M. Abbas, Hamid Ghous","doi":"10.5121/cseij.2022.12117","DOIUrl":null,"url":null,"abstract":"Globally, the frequency of breast cancer and its morality speak to a critical and developing risk for the developing countries. In Asia, Pakistan has the biggest rate of breast cancer. It is evaluated that every year 83,000 cases were reported in Pakistan and over 40,000 deaths are caused by breast cancer. Patients suffering from this malignancy have a better chance of surviving if they are diagnosed early. Many Early identification of breast cancer can be achieved using data mining techniques, allowing preventative treatments to be done. In this research Wisconsin Breast Cancer Dataset (WBCD) and Duke Breast cancer dataset (DBDS) are used with Linear Discriminant Analysis (LDA) feature selection with Support Vector Machine (SVM), Decision Tree (DT), Neural Network and Random Forest (RF) machine learning classifiers to predict breast cancer tumors. The finding of the proposed model is that feature selections through LDA improve the accuracy of detecting tumors and also reduce time duration of executing model. The best machine learning model with LDA feature selection is Neural Network Model with highest accuracy 1.00 among all classification models and also consume less time.","PeriodicalId":361871,"journal":{"name":"Computer Science & Engineering: An International Journal","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Science & Engineering: An International Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5121/cseij.2022.12117","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Globally, the frequency of breast cancer and its morality speak to a critical and developing risk for the developing countries. In Asia, Pakistan has the biggest rate of breast cancer. It is evaluated that every year 83,000 cases were reported in Pakistan and over 40,000 deaths are caused by breast cancer. Patients suffering from this malignancy have a better chance of surviving if they are diagnosed early. Many Early identification of breast cancer can be achieved using data mining techniques, allowing preventative treatments to be done. In this research Wisconsin Breast Cancer Dataset (WBCD) and Duke Breast cancer dataset (DBDS) are used with Linear Discriminant Analysis (LDA) feature selection with Support Vector Machine (SVM), Decision Tree (DT), Neural Network and Random Forest (RF) machine learning classifiers to predict breast cancer tumors. The finding of the proposed model is that feature selections through LDA improve the accuracy of detecting tumors and also reduce time duration of executing model. The best machine learning model with LDA feature selection is Neural Network Model with highest accuracy 1.00 among all classification models and also consume less time.
基于线性判别分析的乳腺癌肿瘤早期检测与不同机器学习分类方法的特征选择
在全球范围内,乳腺癌的发病率及其道德性表明,发展中国家面临着一种严重的、正在发展的风险。在亚洲,巴基斯坦的乳腺癌发病率最高。据评估,巴基斯坦每年报告的乳腺癌病例为83 000例,死亡人数超过40 000人。患有这种恶性肿瘤的患者如果得到早期诊断,生存的机会会更大。使用数据挖掘技术可以实现许多乳腺癌的早期识别,从而可以进行预防性治疗。本研究将威斯康星乳腺癌数据集(WBCD)和杜克乳腺癌数据集(DBDS)与线性判别分析(LDA)特征选择结合支持向量机(SVM)、决策树(DT)、神经网络和随机森林(RF)机器学习分类器进行乳腺癌肿瘤预测。该模型的发现是,通过LDA进行特征选择提高了肿瘤检测的准确性,同时也缩短了模型的执行时间。LDA特征选择的最佳机器学习模型是Neural Network model,在所有分类模型中准确率最高,且耗时更少。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信