MTECC: A Multitask Learning Framework for Esophageal Cancer Analysis

IEEE transactions on artificial intelligence Pub Date : 2024-10-24 DOI:10.1109/TAI.2024.3485524

Jianpeng An;Wenqi Li;Yunhao Bai;Huazhen Chen;Gang Zhao;Qing Cai;Zhongke Gao

{"title":"MTECC: A Multitask Learning Framework for Esophageal Cancer Analysis","authors":"Jianpeng An;Wenqi Li;Yunhao Bai;Huazhen Chen;Gang Zhao;Qing Cai;Zhongke Gao","doi":"10.1109/TAI.2024.3485524","DOIUrl":null,"url":null,"abstract":"In the field of esophageal cancer diagnostics, the accurate identification and classification of tumors and adjacent tissues within whole slide images (WSIs) are critical. However, this task is complicated by the difficulty in annotating normal tissue on tumor-bearing slides, as the infiltration results in a blend of different tissue types, making annotation difficult for pathologists. To overcome this challenge, we introduce the multitask esophageal cancer classification (MTECC) framework, featuring an innovative dual-branch architecture that operates at both global and local levels. The framework initially employs a masked autoencoder (MAE) for self-supervised learning. A distinctive feature of MTECC is the integration of RandoMix, an innovative image augmentation technique that randomly exchanges patches between different images. This method significantly enhances the model's generalization ability, especially for recognizing tissues within cancerous slides. MTECC ingeniously integrates two tasks: tumor detection using global tokens, and fine-grained tissue classification at the patch level using local tokens. The empirical evaluation of the MTECC on our extensive esophageal cancer dataset substantiates its efficacy. The performance metrics indicate robust results, with an accuracy of 0.811, an F1 score of 0.735, and an AUC of 0.957. The MTECC method represents a significant advancement in applying deep learning to complex pathological image analysis, offering valuable tools for pathologists in diagnosing and treating esophageal cancer.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 12","pages":"6739-6751"},"PeriodicalIF":0.0000,"publicationDate":"2024-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on artificial intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10734152/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In the field of esophageal cancer diagnostics, the accurate identification and classification of tumors and adjacent tissues within whole slide images (WSIs) are critical. However, this task is complicated by the difficulty in annotating normal tissue on tumor-bearing slides, as the infiltration results in a blend of different tissue types, making annotation difficult for pathologists. To overcome this challenge, we introduce the multitask esophageal cancer classification (MTECC) framework, featuring an innovative dual-branch architecture that operates at both global and local levels. The framework initially employs a masked autoencoder (MAE) for self-supervised learning. A distinctive feature of MTECC is the integration of RandoMix, an innovative image augmentation technique that randomly exchanges patches between different images. This method significantly enhances the model's generalization ability, especially for recognizing tissues within cancerous slides. MTECC ingeniously integrates two tasks: tumor detection using global tokens, and fine-grained tissue classification at the patch level using local tokens. The empirical evaluation of the MTECC on our extensive esophageal cancer dataset substantiates its efficacy. The performance metrics indicate robust results, with an accuracy of 0.811, an F1 score of 0.735, and an AUC of 0.957. The MTECC method represents a significant advancement in applying deep learning to complex pathological image analysis, offering valuable tools for pathologists in diagnosing and treating esophageal cancer.

查看原文本刊更多论文

食管癌分析的多任务学习框架

在食管癌诊断领域，准确识别和分类全切片图像（WSI）中的肿瘤和邻近组织至关重要。然而，由于肿瘤浸润会导致不同组织类型的混合，病理学家很难对肿瘤载玻片上的正常组织进行标注，从而使这项任务变得复杂。为了克服这一难题，我们推出了多任务食管癌分类（MTECC）框架，该框架采用创新的双分支架构，可在全局和局部两个层面上运行。该框架最初采用掩码自动编码器（MAE）进行自我监督学习。MTECC 的一个显著特点是集成了 RandoMix，这是一种创新的图像增强技术，可在不同图像之间随机交换斑块。这种方法大大增强了模型的泛化能力，尤其是在识别癌变切片中的组织时。MTECC 巧妙地整合了两项任务：使用全局标记检测肿瘤，以及使用局部标记在补丁级进行细粒度组织分类。我们在广泛的食管癌数据集上对 MTECC 进行了实证评估，证实了它的功效。性能指标显示结果很稳定，准确率为 0.811，F1 得分为 0.735，AUC 为 0.957。MTECC 方法代表了将深度学习应用于复杂病理图像分析的重大进展，为病理学家诊断和治疗食管癌提供了宝贵的工具。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE transactions on artificial intelligence

CiteScore

7.70

自引率

0.00%

发文量