{"title":"Early Detection of Breast Cancer Tumors using Linear Discriminant Analysis Feature Selection with Different Machine Learning Classification Methods","authors":"M. Abbas, Hamid Ghous","doi":"10.5121/cseij.2022.12117","DOIUrl":null,"url":null,"abstract":"Globally, the frequency of breast cancer and its morality speak to a critical and developing risk for the developing countries. In Asia, Pakistan has the biggest rate of breast cancer. It is evaluated that every year 83,000 cases were reported in Pakistan and over 40,000 deaths are caused by breast cancer. Patients suffering from this malignancy have a better chance of surviving if they are diagnosed early. Many Early identification of breast cancer can be achieved using data mining techniques, allowing preventative treatments to be done. In this research Wisconsin Breast Cancer Dataset (WBCD) and Duke Breast cancer dataset (DBDS) are used with Linear Discriminant Analysis (LDA) feature selection with Support Vector Machine (SVM), Decision Tree (DT), Neural Network and Random Forest (RF) machine learning classifiers to predict breast cancer tumors. The finding of the proposed model is that feature selections through LDA improve the accuracy of detecting tumors and also reduce time duration of executing model. The best machine learning model with LDA feature selection is Neural Network Model with highest accuracy 1.00 among all classification models and also consume less time.","PeriodicalId":361871,"journal":{"name":"Computer Science & Engineering: An International Journal","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Science & Engineering: An International Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5121/cseij.2022.12117","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Globally, the frequency of breast cancer and its morality speak to a critical and developing risk for the developing countries. In Asia, Pakistan has the biggest rate of breast cancer. It is evaluated that every year 83,000 cases were reported in Pakistan and over 40,000 deaths are caused by breast cancer. Patients suffering from this malignancy have a better chance of surviving if they are diagnosed early. Many Early identification of breast cancer can be achieved using data mining techniques, allowing preventative treatments to be done. In this research Wisconsin Breast Cancer Dataset (WBCD) and Duke Breast cancer dataset (DBDS) are used with Linear Discriminant Analysis (LDA) feature selection with Support Vector Machine (SVM), Decision Tree (DT), Neural Network and Random Forest (RF) machine learning classifiers to predict breast cancer tumors. The finding of the proposed model is that feature selections through LDA improve the accuracy of detecting tumors and also reduce time duration of executing model. The best machine learning model with LDA feature selection is Neural Network Model with highest accuracy 1.00 among all classification models and also consume less time.