Raquel Ochoa-Ornelas , Alberto Gudiño-Ochoa , Julio Alberto García-Rodríguez , Sofia Uribe-Toscano
{"title":"使用EfficientNetB3进行肺癌和结肠癌组织病理学图像检测的稳健迁移学习方法","authors":"Raquel Ochoa-Ornelas , Alberto Gudiño-Ochoa , Julio Alberto García-Rodríguez , Sofia Uribe-Toscano","doi":"10.1016/j.health.2025.100391","DOIUrl":null,"url":null,"abstract":"<div><div>Lung and colon cancers are among the deadliest diseases worldwide, necessitating early and accurate detection to improve patient outcomes. This study utilizes the EfficientNetB3 model, a state-of-the-art transfer learning approach, to enhance the detection of colon and lung cancers from histopathological images. The research leverages the LC25000 dataset, comprising 25,000 histopathological images evenly distributed across five classes: colon adenocarcinoma, benign colon tissue, lung adenocarcinoma, lung squamous cell carcinoma, and benign lung tissue. The EfficientNetB3 model initially achieved an impressive accuracy of 99.39% across all classes. To further validate and enhance the model’s robustness and generalizability, we augmented the dataset by replacing 1,000 cancerous class images with new Genomic Data Commons (GDC) Data Portal - National Cancer Institute images, simulating more diverse clinical scenarios. This modification resulted in an accuracy of 99.39%, with equally high performance across other metrics, including precision, recall, and F1-Score, all reaching 99.39%, and a Matthew’s Correlation Coefficient (MCC) of 99.24%. The Gradient-weighted Class Activation Mapping (Grad-CAM) technique was utilized to visually interpret the model’s decisions, enhancing its transparency and reliability. These findings demonstrate that EfficientNetB3 is an effective and generalizable end-to-end framework for histopathological image analysis with minimal preprocessing. The promising results underscore the potential of EfficientNetB3 to advance automated cancer detection, thereby contributing to earlier diagnosis and more effective treatment strategies.</div></div>","PeriodicalId":73222,"journal":{"name":"Healthcare analytics (New York, N.Y.)","volume":"7 ","pages":"Article 100391"},"PeriodicalIF":0.0000,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A robust transfer learning approach with histopathological images for lung and colon cancer detection using EfficientNetB3\",\"authors\":\"Raquel Ochoa-Ornelas , Alberto Gudiño-Ochoa , Julio Alberto García-Rodríguez , Sofia Uribe-Toscano\",\"doi\":\"10.1016/j.health.2025.100391\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Lung and colon cancers are among the deadliest diseases worldwide, necessitating early and accurate detection to improve patient outcomes. This study utilizes the EfficientNetB3 model, a state-of-the-art transfer learning approach, to enhance the detection of colon and lung cancers from histopathological images. The research leverages the LC25000 dataset, comprising 25,000 histopathological images evenly distributed across five classes: colon adenocarcinoma, benign colon tissue, lung adenocarcinoma, lung squamous cell carcinoma, and benign lung tissue. The EfficientNetB3 model initially achieved an impressive accuracy of 99.39% across all classes. To further validate and enhance the model’s robustness and generalizability, we augmented the dataset by replacing 1,000 cancerous class images with new Genomic Data Commons (GDC) Data Portal - National Cancer Institute images, simulating more diverse clinical scenarios. This modification resulted in an accuracy of 99.39%, with equally high performance across other metrics, including precision, recall, and F1-Score, all reaching 99.39%, and a Matthew’s Correlation Coefficient (MCC) of 99.24%. The Gradient-weighted Class Activation Mapping (Grad-CAM) technique was utilized to visually interpret the model’s decisions, enhancing its transparency and reliability. These findings demonstrate that EfficientNetB3 is an effective and generalizable end-to-end framework for histopathological image analysis with minimal preprocessing. The promising results underscore the potential of EfficientNetB3 to advance automated cancer detection, thereby contributing to earlier diagnosis and more effective treatment strategies.</div></div>\",\"PeriodicalId\":73222,\"journal\":{\"name\":\"Healthcare analytics (New York, N.Y.)\",\"volume\":\"7 \",\"pages\":\"Article 100391\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-04-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Healthcare analytics (New York, N.Y.)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2772442525000103\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Healthcare analytics (New York, N.Y.)","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772442525000103","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
肺癌和结肠癌是世界上最致命的疾病之一,必须及早准确地发现,以改善患者的预后。本研究利用最先进的迁移学习方法——EfficientNetB3模型,从组织病理学图像中增强结肠癌和肺癌的检测。该研究利用LC25000数据集,包括25000张组织病理学图像,均匀分布在5类:结肠腺癌、良性结肠组织、肺腺癌、肺鳞状细胞癌和良性肺组织。effentnetb3模型最初在所有类中实现了令人印象深刻的99.39%的准确率。为了进一步验证和增强模型的鲁棒性和泛化性,我们通过用新的基因组数据共享(GDC)数据门户-国家癌症研究所图像替换1000个癌症类图像来增强数据集,模拟更多样化的临床场景。这种修改导致准确率达到99.39%,在其他指标上表现同样优异,包括精度,召回率和F1-Score,均达到99.39%,马修相关系数(MCC)为99.24%。利用梯度加权类激活映射(Gradient-weighted Class Activation Mapping, Grad-CAM)技术对模型的决策进行可视化解释,提高了模型的透明度和可靠性。这些发现表明,EfficientNetB3是一种有效的、可推广的端到端组织病理学图像分析框架,只需最少的预处理。这些令人鼓舞的结果强调了EfficientNetB3在推进自动化癌症检测方面的潜力,从而有助于早期诊断和更有效的治疗策略。
A robust transfer learning approach with histopathological images for lung and colon cancer detection using EfficientNetB3
Lung and colon cancers are among the deadliest diseases worldwide, necessitating early and accurate detection to improve patient outcomes. This study utilizes the EfficientNetB3 model, a state-of-the-art transfer learning approach, to enhance the detection of colon and lung cancers from histopathological images. The research leverages the LC25000 dataset, comprising 25,000 histopathological images evenly distributed across five classes: colon adenocarcinoma, benign colon tissue, lung adenocarcinoma, lung squamous cell carcinoma, and benign lung tissue. The EfficientNetB3 model initially achieved an impressive accuracy of 99.39% across all classes. To further validate and enhance the model’s robustness and generalizability, we augmented the dataset by replacing 1,000 cancerous class images with new Genomic Data Commons (GDC) Data Portal - National Cancer Institute images, simulating more diverse clinical scenarios. This modification resulted in an accuracy of 99.39%, with equally high performance across other metrics, including precision, recall, and F1-Score, all reaching 99.39%, and a Matthew’s Correlation Coefficient (MCC) of 99.24%. The Gradient-weighted Class Activation Mapping (Grad-CAM) technique was utilized to visually interpret the model’s decisions, enhancing its transparency and reliability. These findings demonstrate that EfficientNetB3 is an effective and generalizable end-to-end framework for histopathological image analysis with minimal preprocessing. The promising results underscore the potential of EfficientNetB3 to advance automated cancer detection, thereby contributing to earlier diagnosis and more effective treatment strategies.