Application of transfer learning and ensemble learning in image-level classification for breast histopathology

IF 4.4 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Yuchao Zheng , Chen Li , Xiaomin Zhou , Haoyuan Chen , Hao Xu , Yixin Li , Haiqing Zhang , Xiaoyan Li , Hongzan Sun , Xinyu Huang , Marcin Grzegorzek
{"title":"Application of transfer learning and ensemble learning in image-level classification for breast histopathology","authors":"Yuchao Zheng ,&nbsp;Chen Li ,&nbsp;Xiaomin Zhou ,&nbsp;Haoyuan Chen ,&nbsp;Hao Xu ,&nbsp;Yixin Li ,&nbsp;Haiqing Zhang ,&nbsp;Xiaoyan Li ,&nbsp;Hongzan Sun ,&nbsp;Xinyu Huang ,&nbsp;Marcin Grzegorzek","doi":"10.1016/j.imed.2022.05.004","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>Breast cancer has the highest prevalence among all cancers in women globally. The classification of histopathological images in the diagnosis of breast cancers is an area of clinical concern. In computer-aided diagnosis, most traditional classification models use a single network to extract features, although this approach has significant limitations. Moreover, many networks are trained and optimized on patient-level datasets, ignoring lower-level data labels.</p></div><div><h3>Methods</h3><p>This paper proposed a deep ensemble model based on image-level labels for the binary classification of breast histopathological images of benign and malignant lesions. First, the BreaKHis dataset was randomly divided into training, validation, and test sets. Then, data augmentation techniques were used to balance the numbers of benign and malignant samples. Third, based on their transfer learning performance and the complementarity between networks, VGG16, Xception, ResNet50, and DenseNet201 were selected as base classifiers.</p></div><div><h3>Results</h3><p>In a ensemble network model with accuracy as the weight, the image-level binary classification achieved an accuracy of <span><math><mrow><mn>98.90</mn><mo>%</mo></mrow></math></span>. To verify the capabilities of our method, it was experimentally compared with the latest transformer and multilayer perception (MLP) models on the same dataset. Our ensemble model showed a <span><math><mrow><mn>5</mn><mo>%</mo></mrow></math></span>–<span><math><mrow><mn>20</mn><mo>%</mo></mrow></math></span> advantage, emphasizing its far-reaching abilities in classification tasks.</p></div><div><h3>Conclusions</h3><p>This research focuses on improving the performance of a classification model with an ensemble algorithm. Transfer learning has an essential role in classification of small datasets, improving training speed and accuracy. Our model may outperform many existing approaches with respect to accuracy and has applications in the field of auxiliary medical diagnosis.</p></div>","PeriodicalId":73400,"journal":{"name":"Intelligent medicine","volume":"3 2","pages":"Pages 115-128"},"PeriodicalIF":4.4000,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligent medicine","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S266710262200047X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Background

Breast cancer has the highest prevalence among all cancers in women globally. The classification of histopathological images in the diagnosis of breast cancers is an area of clinical concern. In computer-aided diagnosis, most traditional classification models use a single network to extract features, although this approach has significant limitations. Moreover, many networks are trained and optimized on patient-level datasets, ignoring lower-level data labels.

Methods

This paper proposed a deep ensemble model based on image-level labels for the binary classification of breast histopathological images of benign and malignant lesions. First, the BreaKHis dataset was randomly divided into training, validation, and test sets. Then, data augmentation techniques were used to balance the numbers of benign and malignant samples. Third, based on their transfer learning performance and the complementarity between networks, VGG16, Xception, ResNet50, and DenseNet201 were selected as base classifiers.

Results

In a ensemble network model with accuracy as the weight, the image-level binary classification achieved an accuracy of 98.90%. To verify the capabilities of our method, it was experimentally compared with the latest transformer and multilayer perception (MLP) models on the same dataset. Our ensemble model showed a 5%20% advantage, emphasizing its far-reaching abilities in classification tasks.

Conclusions

This research focuses on improving the performance of a classification model with an ensemble algorithm. Transfer learning has an essential role in classification of small datasets, improving training speed and accuracy. Our model may outperform many existing approaches with respect to accuracy and has applications in the field of auxiliary medical diagnosis.

迁移学习和集成学习在乳腺组织病理学图像级分类中的应用
背景癌症在全球女性癌症中的发病率最高。乳腺癌诊断中组织病理学图像的分类是临床关注的一个领域。在计算机辅助诊断中,大多数传统的分类模型使用单个网络来提取特征,尽管这种方法有很大的局限性。此外,许多网络在患者级数据集上进行训练和优化,忽略了较低级别的数据标签。方法提出了一种基于图像水平标签的深度集成模型,用于乳腺良恶性病变组织病理学图像的二元分类。首先,将BreaKHis数据集随机分为训练集、验证集和测试集。然后,使用数据增强技术来平衡良性和恶性样本的数量。第三,基于它们的迁移学习性能和网络之间的互补性,选择VGG16、Xception、ResNet50和DenseNet201作为基础分类器。结果在以精度为权重的集成网络模型中,图像级二值分类的精度达到98.90%。为了验证我们的方法的能力,在同一数据集上将其与最新的transformer和多层感知(MLP)模型进行了实验比较。我们的集成模型显示出5%-20%的优势,强调了其在分类任务中的深远能力。结论本研究的重点是用集成算法提高分类模型的性能。迁移学习在小数据集的分类、提高训练速度和准确性方面发挥着至关重要的作用。我们的模型在准确性方面可能优于许多现有方法,并在辅助医疗诊断领域有应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Intelligent medicine
Intelligent medicine Surgery, Radiology and Imaging, Artificial Intelligence, Biomedical Engineering
CiteScore
5.20
自引率
0.00%
发文量
19
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信