Enhancing Robust Liver Cancer Diagnosis: A Contrastive Multi-Modality Learner with Lightweight Fusion and Effective Data Augmentation

ACM transactions on computing for healthcare Pub Date : 2023-12-30 DOI:10.1145/3639414

Pei-Xuan Li, Hsun-Ping Hsieh, Chiang Fan Yang, Ding-You Wu, Ching-Chung Ko

引用次数: 0

Abstract

This paper explores the application of self-supervised contrastive learning in the medical domain, focusing on classification of multi-modality Magnetic Resonance (MR) images. To address the challenges of limited and hard-to-annotate medical data, we introduce multi-modality data augmentation (MDA) and cross-modality group convolution (CGC). In the pre-training phase, we leverage Simple Siamese networks to maximize the similarity between two augmented MR images from a patient, without a handcrafted pretext task. Our approach also combines 3D and 2D group convolution with a channel shuffle operation to efficiently incorporate different modalities of image features. Evaluation on liver MR images from a well-known hospital in Taiwan demonstrates a significant improvement over previous methods. This work contributes to advancing multi-modality contrastive learning, particularly in the context of medical imaging, offering enhanced tools for analyzing complex image data.

查看原文本刊更多论文

增强肝癌诊断的鲁棒性：具有轻量级融合和有效数据增强功能的多模态对比学习器

本文探讨了自监督对比学习在医疗领域的应用，重点是多模态磁共振（MR）图像的分类。为了应对医疗数据有限且难以标注的挑战，我们引入了多模态数据增强（MDA）和跨模态群卷积（CGC）。在预训练阶段，我们利用简单连体网络（Simple Siamese networks）来最大化患者两幅增强磁共振图像之间的相似性，而无需手工制作借口任务。我们的方法还将三维和二维群卷积与通道洗牌操作相结合，有效地整合了不同模式的图像特征。在台湾一家知名医院的肝脏磁共振图像上进行的评估表明，我们的方法比以前的方法有了显著的改进。这项工作有助于推进多模态对比学习，尤其是在医学成像方面，为分析复杂图像数据提供了更强大的工具。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM transactions on computing for healthcare

CiteScore

10.30

自引率

0.00%

发文量