Utilizing the K-Means Algorithm for Breast Cancer Diagnosis: A Promising Approach for Improved Early Detection

Matics: Jurnal Teknik Informatika Pub Date : 2023-10-23 DOI:10.18860/mat.v15i2.23644

Nur Fitriyah Ayu Tunjung Sari, Maharini Nabela, Muhammad Falah Abdurrohman

{"title":"Utilizing the K-Means Algorithm for Breast Cancer Diagnosis: A Promising Approach for Improved Early Detection","authors":"Nur Fitriyah Ayu Tunjung Sari, Maharini Nabela, Muhammad Falah Abdurrohman","doi":"10.18860/mat.v15i2.23644","DOIUrl":null,"url":null,"abstract":"Breast cancer is a pressing non-communicable disease, especially affecting women, with its incidence on the rise. In 2020, it ranked among the most common cancers in Indonesia. Timely detection and precise diagnosis are pivotal for effective breast cancer management. To enhance diagnostic accuracy, the K-means clustering method is applied to group patients based on shared attributes. This research aims to contribute significantly to breast cancer diagnosis by leveraging the K-means method, potentially improving patient survival rates.The research process involves data collection, preprocessing, K-means application, evaluation, and visualization. A dataset of 569 breast cancer patient records with 32 attributes from Kaggle is utilized. The K-Means algorithm is assessed using accuracy, yielding a value of 0.8457, signifying good performance. Malignant cases (211) and benign cases (301) are visualized in a scatter plot, distinguishing between them.In conclusion, this study presents an initial step in utilizing the K-means algorithm for breast cancer diagnosis, offering promising results. Further research and the development of more advanced models are imperative to address the global health challenge posed by breast cancer among women.Index Terms—breast cancer; clustering; K-Means Algorithm","PeriodicalId":497787,"journal":{"name":"Matics: Jurnal Teknik Informatika","volume":"29 6","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Matics: Jurnal Teknik Informatika","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18860/mat.v15i2.23644","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Breast cancer is a pressing non-communicable disease, especially affecting women, with its incidence on the rise. In 2020, it ranked among the most common cancers in Indonesia. Timely detection and precise diagnosis are pivotal for effective breast cancer management. To enhance diagnostic accuracy, the K-means clustering method is applied to group patients based on shared attributes. This research aims to contribute significantly to breast cancer diagnosis by leveraging the K-means method, potentially improving patient survival rates.The research process involves data collection, preprocessing, K-means application, evaluation, and visualization. A dataset of 569 breast cancer patient records with 32 attributes from Kaggle is utilized. The K-Means algorithm is assessed using accuracy, yielding a value of 0.8457, signifying good performance. Malignant cases (211) and benign cases (301) are visualized in a scatter plot, distinguishing between them.In conclusion, this study presents an initial step in utilizing the K-means algorithm for breast cancer diagnosis, offering promising results. Further research and the development of more advanced models are imperative to address the global health challenge posed by breast cancer among women.Index Terms—breast cancer; clustering; K-Means Algorithm

查看原文本刊更多论文

利用k -均值算法进行乳腺癌诊断:一种改善早期检测的有希望的方法

乳腺癌是一种紧迫的非传染性疾病，对妇女的影响尤其大，其发病率呈上升趋势。到2020年，它已成为印尼最常见的癌症之一。及时发现和准确诊断是有效治疗乳腺癌的关键。为了提高诊断的准确性，采用K-means聚类方法基于共享属性对患者进行分组。本研究旨在通过利用K-means方法为乳腺癌诊断做出重大贡献，潜在地提高患者的生存率。研究过程包括数据收集、预处理、K-means应用、评估和可视化。使用了Kaggle的569个乳腺癌患者记录的数据集，其中包含32个属性。K-Means算法使用精度进行评估，产生的值为0.8457，表明性能良好。恶性病例(211例)和良性病例(301例)在散点图中可视化，以区分它们。总之，本研究在利用K-means算法进行乳腺癌诊断方面迈出了第一步，并提供了有希望的结果。必须进一步研究和开发更先进的模型，以应对妇女乳腺癌对全球健康构成的挑战。索引术语:乳腺癌;聚类;k - means算法

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Matics: Jurnal Teknik Informatika

自引率

0.00%

发文量