Breast Mass Classification with Deep Transfer Feature Extractor Model and Random Forest Classifier

Aarti Bokade, Ankit Shah
{"title":"Breast Mass Classification with Deep Transfer Feature Extractor Model and Random Forest Classifier","authors":"Aarti Bokade, Ankit Shah","doi":"10.1109/RTEICT52294.2021.9573909","DOIUrl":null,"url":null,"abstract":"Breast Cancer is the most common type of cancer & leading cause of deaths in women worldwide. Early diagnosis of breast cancer and proper treatment play a vital role in death rate reduction. The success of Deep Convolutional Neural Networks (CNN) models in image classification tasks with state of art level accuracy has always attracted researchers to use them for disease diagnosis in the field of medical imaging. The proposed method uses CNN based fixed feature extraction technique, a type of deep transfer learning approach to perform binary classification of breast masses using Mammography Images. Mammography images are obtained from three publicly available datasets namely Mammographic Image Analysis Society (MIAS), Digital Database for Screening Mammography (DDSM) and Inbreast. The pretrained CNN models: VGG16, VGG19 & Resnet-50 performs feature extraction from the mammography images. The extracted features from CNN models are then classified into malignant & benign masses using Random Forest machine learning classifier. The models performances have been summarized with performance matrices (Sensitivity, Specificity, F1-score, and Accuracy), onfusion matrices and ROC (Receiver Operating Characteristics) curves. The combination of pretrained models, VGG16/VGG19/Resnet-50 & RF classifier gave model accuracies of: 0.81%, 0.80%,0.83% for the MIAS dataset, 0.994%, 0.986%, 0.996% for DDSM Dataset and 0.83%, 0.81%,0.87% for Inbreast dataset respectively. Automated classification of breast mass from the mammography images can be used by the doctors as a quick and efficient method for breast cancer screening.","PeriodicalId":191410,"journal":{"name":"2021 International Conference on Recent Trends on Electronics, Information, Communication & Technology (RTEICT)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Recent Trends on Electronics, Information, Communication & Technology (RTEICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RTEICT52294.2021.9573909","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Breast Cancer is the most common type of cancer & leading cause of deaths in women worldwide. Early diagnosis of breast cancer and proper treatment play a vital role in death rate reduction. The success of Deep Convolutional Neural Networks (CNN) models in image classification tasks with state of art level accuracy has always attracted researchers to use them for disease diagnosis in the field of medical imaging. The proposed method uses CNN based fixed feature extraction technique, a type of deep transfer learning approach to perform binary classification of breast masses using Mammography Images. Mammography images are obtained from three publicly available datasets namely Mammographic Image Analysis Society (MIAS), Digital Database for Screening Mammography (DDSM) and Inbreast. The pretrained CNN models: VGG16, VGG19 & Resnet-50 performs feature extraction from the mammography images. The extracted features from CNN models are then classified into malignant & benign masses using Random Forest machine learning classifier. The models performances have been summarized with performance matrices (Sensitivity, Specificity, F1-score, and Accuracy), onfusion matrices and ROC (Receiver Operating Characteristics) curves. The combination of pretrained models, VGG16/VGG19/Resnet-50 & RF classifier gave model accuracies of: 0.81%, 0.80%,0.83% for the MIAS dataset, 0.994%, 0.986%, 0.996% for DDSM Dataset and 0.83%, 0.81%,0.87% for Inbreast dataset respectively. Automated classification of breast mass from the mammography images can be used by the doctors as a quick and efficient method for breast cancer screening.
基于深度转移特征提取模型和随机森林分类器的乳腺肿块分类
乳腺癌是最常见的癌症类型,也是全世界妇女死亡的主要原因。乳腺癌的早期诊断和适当治疗对降低死亡率起着至关重要的作用。深度卷积神经网络(CNN)模型在图像分类任务中取得了成功,其精度达到了最先进的水平,一直吸引着研究人员将其用于医学成像领域的疾病诊断。该方法使用基于CNN的固定特征提取技术(一种深度迁移学习方法)对乳房肿块进行二值分类。乳房x线摄影图像来自三个公开可用的数据集,即乳房x线摄影图像分析协会(MIAS),乳房x线摄影筛查数字数据库(DDSM)和Inbreast。预训练的CNN模型:VGG16, VGG19和Resnet-50从乳房x光图像中提取特征。然后使用Random Forest机器学习分类器将从CNN模型中提取的特征分类为恶性和良性肿块。模型的性能用性能矩阵(灵敏度、特异性、f1评分和准确性)、混淆矩阵和ROC(受试者工作特征)曲线进行总结。结合预训练模型、VGG16/VGG19/Resnet-50和RF分类器,MIAS数据集的模型精度分别为0.81%、0.80%、0.83%,DDSM数据集的模型精度分别为0.994%、0.986%、0.996%,Inbreast数据集的模型精度分别为0.83%、0.81%、0.87%。从乳房x光摄影图像中对乳房肿块进行自动分类是一种快速有效的乳腺癌筛查方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信