Multi-modal learning methods in medical imaging area: A survey

IF 2.9 3区工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

Digital Signal Processing Pub Date : 2025-07-04 DOI:10.1016/j.dsp.2025.105441

Yibo Sun, Weitong Chen, Zhe Sun

{"title":"Multi-modal learning methods in medical imaging area: A survey","authors":"Yibo Sun, Weitong Chen, Zhe Sun","doi":"10.1016/j.dsp.2025.105441","DOIUrl":null,"url":null,"abstract":"<div><div>Multi-modal learning is an important branch in the field of deep learning area, which has been widely used for processing data from different media. The fusion of different modalities in natural images has shown significant results, but less attention has been paid to medical images of individual modalities due to data scarcity. The discussion of applications of multi-modal learning has raised great interest in the medical field, including general fusion methods, deep learning-based methods, and large language model-based methods. With the aim of describing the evolution of different models in the field of multi-modal medical imaging, this survey provides a thorough overview of representative methods and related applications. In this study, we first introduced the concept of modality and the development of multi-modal learning, then listed the commonly used medical modalities and fusion strategies. After that, we described the branches of multi-modal models in the medical imaging field in detail, along with various application scenarios and open datasets. We hope our survey will provide guidance for readers to understand typical models and the growing trend within the medical imaging domain.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"167 ","pages":"Article 105441"},"PeriodicalIF":2.9000,"publicationDate":"2025-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1051200425004634","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

Multi-modal learning is an important branch in the field of deep learning area, which has been widely used for processing data from different media. The fusion of different modalities in natural images has shown significant results, but less attention has been paid to medical images of individual modalities due to data scarcity. The discussion of applications of multi-modal learning has raised great interest in the medical field, including general fusion methods, deep learning-based methods, and large language model-based methods. With the aim of describing the evolution of different models in the field of multi-modal medical imaging, this survey provides a thorough overview of representative methods and related applications. In this study, we first introduced the concept of modality and the development of multi-modal learning, then listed the commonly used medical modalities and fusion strategies. After that, we described the branches of multi-modal models in the medical imaging field in detail, along with various application scenarios and open datasets. We hope our survey will provide guidance for readers to understand typical models and the growing trend within the medical imaging domain.

查看原文本刊更多论文

医学影像领域多模式学习方法综述

多模态学习是深度学习领域的一个重要分支，已被广泛用于处理不同媒介的数据。自然图像中不同模态的融合已经显示出显著的效果，但由于数据缺乏，对单个模态的医学图像的关注较少。多模态学习的应用讨论引起了医学领域的极大兴趣，包括通用融合方法、基于深度学习的方法和基于大语言模型的方法。为了描述多模态医学成像领域中不同模型的发展，本调查提供了代表性方法及其相关应用的全面概述。在本研究中，我们首先介绍了模态的概念和多模态学习的发展，然后列出了常用的医学模态和融合策略。然后，我们详细描述了多模态模型在医学成像领域的分支，以及各种应用场景和开放数据集。我们希望我们的调查能够为读者了解医学成像领域的典型模型和发展趋势提供指导。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Digital Signal Processing 工程技术-工程：电子与电气

CiteScore

5.30

自引率

17.20%

发文量

435

审稿时长

66 days

期刊介绍： Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal. The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as: • big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,