基于机器学习的乳腺癌可视化与分类

P. S. Shekar Varma, Sushil Kumar, K. Sri Vasuki Reddy
{"title":"基于机器学习的乳腺癌可视化与分类","authors":"P. S. Shekar Varma, Sushil Kumar, K. Sri Vasuki Reddy","doi":"10.1109/ICITIIT51526.2021.9399603","DOIUrl":null,"url":null,"abstract":"In contemporary years, the categorization of breast cancer has become an engrossing subject in the department of healthcare informatics due to prodigious deaths of the women across the world caused by this cancer. With the upcoming heed and variety of approaches in image processing and machine learning (ML), there has been an endeavor to erect a pattern recognition model that is well-grounded to boost the diagnosis standard. Diverse research has been attempted on mastering the prediction of the possibility of breast cancer using predefined data mining algorithms. In this paper, a model is presented using the support vector machine (SVM) algorithm for the manual categorizing of the histology images of breast cancer samples into benign and malignant subclasses to anticipate the interpretation. Primarily all the data incorporating a set of 30 features relating to the cell nuclei shown in the digitalized images of fine needle aspirate (FNA) of a breast mass are considered. Ten existing values of features are added up for every nuclei sample then the mean, the standard deviation, the worst and largest of the mentioned attributes are measured proceeding to 30 features. The total features obtained are visualized and apprehended to gain insight for future diagnosis. The principal component analysis (PCA) dimensionality reduction strategy is implemented to successfully augment the valiance of the attributes resolving eigenvector problem. The ultimate outcome is conceptualized using the confusion matrix and the receiver operating characteristic curve (ROC). This SVM forged model proves to show 97% accuracy with the recommended dataset.","PeriodicalId":161452,"journal":{"name":"2021 International Conference on Innovative Trends in Information Technology (ICITIIT)","volume":"125 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Machine Learning Based Breast Cancer Visualization and Classification\",\"authors\":\"P. S. Shekar Varma, Sushil Kumar, K. Sri Vasuki Reddy\",\"doi\":\"10.1109/ICITIIT51526.2021.9399603\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In contemporary years, the categorization of breast cancer has become an engrossing subject in the department of healthcare informatics due to prodigious deaths of the women across the world caused by this cancer. With the upcoming heed and variety of approaches in image processing and machine learning (ML), there has been an endeavor to erect a pattern recognition model that is well-grounded to boost the diagnosis standard. Diverse research has been attempted on mastering the prediction of the possibility of breast cancer using predefined data mining algorithms. In this paper, a model is presented using the support vector machine (SVM) algorithm for the manual categorizing of the histology images of breast cancer samples into benign and malignant subclasses to anticipate the interpretation. Primarily all the data incorporating a set of 30 features relating to the cell nuclei shown in the digitalized images of fine needle aspirate (FNA) of a breast mass are considered. Ten existing values of features are added up for every nuclei sample then the mean, the standard deviation, the worst and largest of the mentioned attributes are measured proceeding to 30 features. The total features obtained are visualized and apprehended to gain insight for future diagnosis. The principal component analysis (PCA) dimensionality reduction strategy is implemented to successfully augment the valiance of the attributes resolving eigenvector problem. The ultimate outcome is conceptualized using the confusion matrix and the receiver operating characteristic curve (ROC). This SVM forged model proves to show 97% accuracy with the recommended dataset.\",\"PeriodicalId\":161452,\"journal\":{\"name\":\"2021 International Conference on Innovative Trends in Information Technology (ICITIIT)\",\"volume\":\"125 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-02-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Innovative Trends in Information Technology (ICITIIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICITIIT51526.2021.9399603\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Innovative Trends in Information Technology (ICITIIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICITIIT51526.2021.9399603","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

近年来,由于全世界妇女因乳腺癌死亡的人数惊人,乳腺癌的分类已成为医疗信息学领域一个引人关注的课题。随着图像处理和机器学习(ML)方法的不断发展和多样化,人们一直在努力建立一个有充分基础的模式识别模型,以提高诊断标准。利用预定义的数据挖掘算法来掌握乳腺癌可能性的预测已经进行了各种各样的研究。本文提出了一种基于支持向量机(SVM)算法的模型,用于将乳腺癌样本的组织学图像手工分类为良性和恶性亚类,并预测其解释。首先,所有的数据,包括一组30个特征有关的细胞核显示在乳腺肿块的细针抽吸(FNA)的数字化图像。将每个核样本的十个现有特征值相加,然后测量上述属性的平均值、标准差、最差值和最大值,直至30个特征。获得的总体特征被可视化和理解,以获得对未来诊断的洞察力。采用主成分分析降维策略,有效地提高了特征向量特征值的有效性。使用混淆矩阵和受试者工作特征曲线(ROC)对最终结果进行概念化。该SVM伪造模型在推荐的数据集上显示出97%的准确率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Machine Learning Based Breast Cancer Visualization and Classification
In contemporary years, the categorization of breast cancer has become an engrossing subject in the department of healthcare informatics due to prodigious deaths of the women across the world caused by this cancer. With the upcoming heed and variety of approaches in image processing and machine learning (ML), there has been an endeavor to erect a pattern recognition model that is well-grounded to boost the diagnosis standard. Diverse research has been attempted on mastering the prediction of the possibility of breast cancer using predefined data mining algorithms. In this paper, a model is presented using the support vector machine (SVM) algorithm for the manual categorizing of the histology images of breast cancer samples into benign and malignant subclasses to anticipate the interpretation. Primarily all the data incorporating a set of 30 features relating to the cell nuclei shown in the digitalized images of fine needle aspirate (FNA) of a breast mass are considered. Ten existing values of features are added up for every nuclei sample then the mean, the standard deviation, the worst and largest of the mentioned attributes are measured proceeding to 30 features. The total features obtained are visualized and apprehended to gain insight for future diagnosis. The principal component analysis (PCA) dimensionality reduction strategy is implemented to successfully augment the valiance of the attributes resolving eigenvector problem. The ultimate outcome is conceptualized using the confusion matrix and the receiver operating characteristic curve (ROC). This SVM forged model proves to show 97% accuracy with the recommended dataset.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信