{"title":"Enhancing Association Rules using Generative Adversarial Networks for Breast Cancer Classification","authors":"Menatalla Haggag, Lubana Al Rayes, Z. Aghbari","doi":"10.1109/ICETSIS61505.2024.10459572","DOIUrl":null,"url":null,"abstract":"The core algorithms of data mining (DM) enable the discovery of new information and insights by analyzing large amounts of data. Association rules mining (ARM), one of the several DM approaches, is extremely important in DM research. By utilizing ARM in medical diagnosis, early disease detection can be enhanced, and treatment recommendations can be improved based on data-driven insights. Breast cancer remains the leading cause of cancer-related deaths among women on a global scale. It is a huge challenge to researchers in the medical field concerning its diagnosis and prognosis. This paper aims to leverage ARM for the generation of associations that contribute to either recurrence or no-recurrence events in breast cancer. The study utilizes the Breast Cancer dataset from the UCI repository. To ensure comprehensive coverage of associations in both classes, the dataset is balanced using Synthetic Minority Over-sampling Technique (SMOTE) and Generative Adversarial Networks (GAN). Utilizing GAN to balance the dataset enhanced the performance of the association classification.","PeriodicalId":518932,"journal":{"name":"2024 ASU International Conference in Emerging Technologies for Sustainability and Intelligent Systems (ICETSIS)","volume":"40 1","pages":"634-638"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2024 ASU International Conference in Emerging Technologies for Sustainability and Intelligent Systems (ICETSIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICETSIS61505.2024.10459572","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The core algorithms of data mining (DM) enable the discovery of new information and insights by analyzing large amounts of data. Association rules mining (ARM), one of the several DM approaches, is extremely important in DM research. By utilizing ARM in medical diagnosis, early disease detection can be enhanced, and treatment recommendations can be improved based on data-driven insights. Breast cancer remains the leading cause of cancer-related deaths among women on a global scale. It is a huge challenge to researchers in the medical field concerning its diagnosis and prognosis. This paper aims to leverage ARM for the generation of associations that contribute to either recurrence or no-recurrence events in breast cancer. The study utilizes the Breast Cancer dataset from the UCI repository. To ensure comprehensive coverage of associations in both classes, the dataset is balanced using Synthetic Minority Over-sampling Technique (SMOTE) and Generative Adversarial Networks (GAN). Utilizing GAN to balance the dataset enhanced the performance of the association classification.
数据挖掘(DM)的核心算法能够通过分析大量数据发现新信息和新见解。关联规则挖掘(ARM)是几种数据挖掘方法之一,在数据挖掘研究中极为重要。在医疗诊断中利用关联规则挖掘,可以提高疾病的早期发现率,并根据数据驱动的洞察力改进治疗建议。乳腺癌仍然是全球妇女因癌症死亡的主要原因。对于医学领域的研究人员来说,乳腺癌的诊断和预后是一个巨大的挑战。本文旨在利用 ARM 生成有助于乳腺癌复发或不再复发的关联。该研究利用了 UCI 数据库中的乳腺癌数据集。为确保两类关联的全面覆盖,数据集使用合成少数群体过度采样技术(SMOTE)和生成对抗网络(GAN)进行平衡。利用 GAN 平衡数据集提高了关联分类的性能。