基于无监督学习技术的K-Means聚类算法的乳腺癌预测

Soumyalatha Naveen, Nachiketh V. Kashyap, Varun P Kulkarni, Sandeep A, M. S. Chakradhar
{"title":"基于无监督学习技术的K-Means聚类算法的乳腺癌预测","authors":"Soumyalatha Naveen, Nachiketh V. Kashyap, Varun P Kulkarni, Sandeep A, M. S. Chakradhar","doi":"10.1109/ViTECoN58111.2023.10157765","DOIUrl":null,"url":null,"abstract":"Breast cancer is a highly deadly and common cancer that is spreading over the world and claiming many lives. Breast cancer develops in the glandular tissue of the breast in the lining of epithelial cells of ducts or (15%) lobules. These tissues heal with time. These in situ tumours may develop over time and infect the breast cells' immediate environs (stage 1), impact the lymph nodes, and finally totally spread throughout the body's organs (stage 3). There were 685,000 deaths worldwide in 2020 and a later rise in the number of malignant women. Cancer was a common disease that affected 7.8 million of the female population. The k-means algorithm is a popular data clustering algorithm. As the main analytical routine in data mining, the techniques of the clustering algorithm will impact the clustering outcome directly. This paper examines the shortcomings of the standard k-means algorithm and discusses them. This paper reviews existing methods for selecting the number of clusters for the algorithm. However, one of its drawbacks is the requirement that the number of clusters, K, be specified before the algorithm is applied. Therefore, we obtained an accuracy of 85% after executing the program, and we were able to depict the difference between a benign and a malignant tumour. And we were able to see the centroid between the two tumours.","PeriodicalId":407488,"journal":{"name":"2023 2nd International Conference on Vision Towards Emerging Trends in Communication and Networking Technologies (ViTECoN)","volume":"322 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Breast Cancer Prediction Using Unsupervised Learning Technique K-Means Clustering Algorithm\",\"authors\":\"Soumyalatha Naveen, Nachiketh V. Kashyap, Varun P Kulkarni, Sandeep A, M. S. Chakradhar\",\"doi\":\"10.1109/ViTECoN58111.2023.10157765\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Breast cancer is a highly deadly and common cancer that is spreading over the world and claiming many lives. Breast cancer develops in the glandular tissue of the breast in the lining of epithelial cells of ducts or (15%) lobules. These tissues heal with time. These in situ tumours may develop over time and infect the breast cells' immediate environs (stage 1), impact the lymph nodes, and finally totally spread throughout the body's organs (stage 3). There were 685,000 deaths worldwide in 2020 and a later rise in the number of malignant women. Cancer was a common disease that affected 7.8 million of the female population. The k-means algorithm is a popular data clustering algorithm. As the main analytical routine in data mining, the techniques of the clustering algorithm will impact the clustering outcome directly. This paper examines the shortcomings of the standard k-means algorithm and discusses them. This paper reviews existing methods for selecting the number of clusters for the algorithm. However, one of its drawbacks is the requirement that the number of clusters, K, be specified before the algorithm is applied. Therefore, we obtained an accuracy of 85% after executing the program, and we were able to depict the difference between a benign and a malignant tumour. And we were able to see the centroid between the two tumours.\",\"PeriodicalId\":407488,\"journal\":{\"name\":\"2023 2nd International Conference on Vision Towards Emerging Trends in Communication and Networking Technologies (ViTECoN)\",\"volume\":\"322 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 2nd International Conference on Vision Towards Emerging Trends in Communication and Networking Technologies (ViTECoN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ViTECoN58111.2023.10157765\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 2nd International Conference on Vision Towards Emerging Trends in Communication and Networking Technologies (ViTECoN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ViTECoN58111.2023.10157765","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

乳腺癌是一种高度致命的常见癌症,正在世界范围内蔓延,夺走了许多人的生命。乳腺癌发生在乳腺腺组织中导管上皮细胞的内壁或小叶中(15%)。这些组织会随着时间愈合。随着时间的推移,这些原位肿瘤可能会发展并感染乳腺细胞的周围环境(第一阶段),影响淋巴结,最终完全扩散到全身器官(第三阶段)。2020年,全球有68.5万人死亡,后来恶性女性的数量上升。癌症是一种常见病,影响了780万女性人口。k-means算法是一种流行的数据聚类算法。聚类算法作为数据挖掘中的主要分析程序,其技术优劣直接影响聚类结果。本文分析了标准k-均值算法的缺点,并对其进行了讨论。本文回顾了现有的选择算法簇数的方法。然而,它的缺点之一是要求在应用算法之前指定簇的数量K。因此,在执行程序后,我们获得了85%的准确率,并且我们能够描述良性和恶性肿瘤之间的区别。我们可以看到两个肿瘤之间的质心。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Breast Cancer Prediction Using Unsupervised Learning Technique K-Means Clustering Algorithm
Breast cancer is a highly deadly and common cancer that is spreading over the world and claiming many lives. Breast cancer develops in the glandular tissue of the breast in the lining of epithelial cells of ducts or (15%) lobules. These tissues heal with time. These in situ tumours may develop over time and infect the breast cells' immediate environs (stage 1), impact the lymph nodes, and finally totally spread throughout the body's organs (stage 3). There were 685,000 deaths worldwide in 2020 and a later rise in the number of malignant women. Cancer was a common disease that affected 7.8 million of the female population. The k-means algorithm is a popular data clustering algorithm. As the main analytical routine in data mining, the techniques of the clustering algorithm will impact the clustering outcome directly. This paper examines the shortcomings of the standard k-means algorithm and discusses them. This paper reviews existing methods for selecting the number of clusters for the algorithm. However, one of its drawbacks is the requirement that the number of clusters, K, be specified before the algorithm is applied. Therefore, we obtained an accuracy of 85% after executing the program, and we were able to depict the difference between a benign and a malignant tumour. And we were able to see the centroid between the two tumours.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信