Improved Breast Cancer Diagnosis through Transfer Learning on Hematoxylin and Eosin Stained Histology Images

Fahad Ahmed, Reem Abdel-Salam, Leon Hamnett, Mary Adewunmi, Temitope Ayano
{"title":"Improved Breast Cancer Diagnosis through Transfer Learning on Hematoxylin and Eosin Stained Histology Images","authors":"Fahad Ahmed, Reem Abdel-Salam, Leon Hamnett, Mary Adewunmi, Temitope Ayano","doi":"arxiv-2309.08745","DOIUrl":null,"url":null,"abstract":"Breast cancer is one of the leading causes of death for women worldwide.\nEarly screening is essential for early identification, but the chance of\nsurvival declines as the cancer progresses into advanced stages. For this\nstudy, the most recent BRACS dataset of histological (H\\&E) stained images was\nused to classify breast cancer tumours, which contains both the whole-slide\nimages (WSI) and region-of-interest (ROI) images, however, for our study we\nhave considered ROI images. We have experimented using different pre-trained\ndeep learning models, such as Xception, EfficientNet, ResNet50, and\nInceptionResNet, pre-trained on the ImageNet weights. We pre-processed the\nBRACS ROI along with image augmentation, upsampling, and dataset split\nstrategies. For the default dataset split, the best results were obtained by\nResNet50 achieving 66\\% f1-score. For the custom dataset split, the best\nresults were obtained by performing upsampling and image augmentation which\nresults in 96.2\\% f1-score. Our second approach also reduced the number of\nfalse positive and false negative classifications to less than 3\\% for each\nclass. We believe that our study significantly impacts the early diagnosis and\nidentification of breast cancer tumors and their subtypes, especially atypical\nand malignant tumors, thus improving patient outcomes and reducing patient\nmortality rates. Overall, this study has primarily focused on identifying seven\n(7) breast cancer tumor subtypes, and we believe that the experimental models\ncan be fine-tuned further to generalize over previous breast cancer histology\ndatasets as well.","PeriodicalId":501321,"journal":{"name":"arXiv - QuanBio - Cell Behavior","volume":"6 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Cell Behavior","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2309.08745","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Breast cancer is one of the leading causes of death for women worldwide. Early screening is essential for early identification, but the chance of survival declines as the cancer progresses into advanced stages. For this study, the most recent BRACS dataset of histological (H\&E) stained images was used to classify breast cancer tumours, which contains both the whole-slide images (WSI) and region-of-interest (ROI) images, however, for our study we have considered ROI images. We have experimented using different pre-trained deep learning models, such as Xception, EfficientNet, ResNet50, and InceptionResNet, pre-trained on the ImageNet weights. We pre-processed the BRACS ROI along with image augmentation, upsampling, and dataset split strategies. For the default dataset split, the best results were obtained by ResNet50 achieving 66\% f1-score. For the custom dataset split, the best results were obtained by performing upsampling and image augmentation which results in 96.2\% f1-score. Our second approach also reduced the number of false positive and false negative classifications to less than 3\% for each class. We believe that our study significantly impacts the early diagnosis and identification of breast cancer tumors and their subtypes, especially atypical and malignant tumors, thus improving patient outcomes and reducing patient mortality rates. Overall, this study has primarily focused on identifying seven (7) breast cancer tumor subtypes, and we believe that the experimental models can be fine-tuned further to generalize over previous breast cancer histology datasets as well.
苏木精和伊红染色组织学图像迁移学习提高乳腺癌诊断
乳腺癌是全世界妇女死亡的主要原因之一。早期筛查对于早期识别至关重要,但随着癌症进展到晚期,存活的机会就会下降。在本研究中,最新的BRACS组织学(H\&E)染色图像数据集用于对乳腺癌肿瘤进行分类,其中包括全幻灯片图像(WSI)和感兴趣区域(ROI)图像,然而,在我们的研究中,我们考虑了ROI图像。我们尝试使用不同的预训练深度学习模型,如Xception、EfficientNet、ResNet50和inceptionresnet,在ImageNet权重上进行预训练。我们预处理了bracs ROI以及图像增强、上采样和数据集分割策略。对于默认的数据集分割,resnet50获得了最好的结果,达到66\% f1-score。对于自定义数据集分割,通过执行上采样和图像增强获得最佳结果,其结果为96.2% f1-score。我们的第二种方法还将每个类别的误阳性和误阴性分类数量减少到低于3%。我们认为,我们的研究对乳腺癌肿瘤及其亚型的早期诊断和鉴别,特别是非典型和恶性肿瘤的早期诊断和鉴别,从而改善患者的预后,降低患者的死亡率。总的来说,这项研究主要集中在确定7种乳腺癌肿瘤亚型,我们相信实验模型可以进一步微调,以推广以前的乳腺癌组织学数据集。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信