Abrar Yaqoob, Navneet Kumar Verma, Rabia Musheer Aziz
{"title":"Improving breast cancer classification with mRMR + SS0 + WSVM: a hybrid approach","authors":"Abrar Yaqoob, Navneet Kumar Verma, Rabia Musheer Aziz","doi":"10.1007/s11042-024-20146-6","DOIUrl":null,"url":null,"abstract":"<p>Detecting breast cancer through histopathological images is time-consuming due to their volume and complexity. Speeding up early detection is crucial for timely medical intervention. Accurately classifying microarray data faces challenges from its dimensionality and noise. Researchers use gene selection techniques to address this issue. Additional techniques like pre-processing, ensemble, and normalization procedures aim to improve image quality. These can also impact classification approaches, helping resolve overfitting and data balance issues. A more sophisticated version could potentially boost classification accuracy while reducing overfitting. Recent technological advances have driven automated breast cancer diagnosis. This research introduces a novel method using Salp Swarm Optimization (SSO) and Support Vector Machines (SVMs) for gene selection and breast tumor classification. The process involves two stages: mRMR preselects genes based on their relevance and distinctiveness, followed by SSO-integrated WSVM for classification. WSVM, aided by SSO, trims redundant genes and assigns weights, enhancing gene significance. SSO also fine-tunes kernel parameters based on gene weights. Experimental results showcase the effectiveness of the mRMR-SSO-WSVM method, achieving high accuracy, precision, recall, and F1-score on breast gene expression datasets. Specifically, our approach achieved an accuracy of 99.62%, precision of 100%, recall of 100%, and an F1-score of 99.10%. Comparative analysis with existing methods demonstrates the superiority of our approach, with a 4% improvement in accuracy and a 3.5% increase in F1-score over traditional SVM-based methods. In conclusion, this study demonstrates the potential of the proposed mRMR-SSO-WSVM methodology in advancing breast cancer classification, offering significant improvements in performance metrics and effectively addressing challenges such as overfitting and data imbalance.</p>","PeriodicalId":18770,"journal":{"name":"Multimedia Tools and Applications","volume":null,"pages":null},"PeriodicalIF":3.0000,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Multimedia Tools and Applications","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s11042-024-20146-6","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Detecting breast cancer through histopathological images is time-consuming due to their volume and complexity. Speeding up early detection is crucial for timely medical intervention. Accurately classifying microarray data faces challenges from its dimensionality and noise. Researchers use gene selection techniques to address this issue. Additional techniques like pre-processing, ensemble, and normalization procedures aim to improve image quality. These can also impact classification approaches, helping resolve overfitting and data balance issues. A more sophisticated version could potentially boost classification accuracy while reducing overfitting. Recent technological advances have driven automated breast cancer diagnosis. This research introduces a novel method using Salp Swarm Optimization (SSO) and Support Vector Machines (SVMs) for gene selection and breast tumor classification. The process involves two stages: mRMR preselects genes based on their relevance and distinctiveness, followed by SSO-integrated WSVM for classification. WSVM, aided by SSO, trims redundant genes and assigns weights, enhancing gene significance. SSO also fine-tunes kernel parameters based on gene weights. Experimental results showcase the effectiveness of the mRMR-SSO-WSVM method, achieving high accuracy, precision, recall, and F1-score on breast gene expression datasets. Specifically, our approach achieved an accuracy of 99.62%, precision of 100%, recall of 100%, and an F1-score of 99.10%. Comparative analysis with existing methods demonstrates the superiority of our approach, with a 4% improvement in accuracy and a 3.5% increase in F1-score over traditional SVM-based methods. In conclusion, this study demonstrates the potential of the proposed mRMR-SSO-WSVM methodology in advancing breast cancer classification, offering significant improvements in performance metrics and effectively addressing challenges such as overfitting and data imbalance.
期刊介绍:
Multimedia Tools and Applications publishes original research articles on multimedia development and system support tools as well as case studies of multimedia applications. It also features experimental and survey articles. The journal is intended for academics, practitioners, scientists and engineers who are involved in multimedia system research, design and applications. All papers are peer reviewed.
Specific areas of interest include:
- Multimedia Tools:
- Multimedia Applications:
- Prototype multimedia systems and platforms