{"title":"医学影像领域多模式学习方法综述","authors":"Yibo Sun, Weitong Chen, Zhe Sun","doi":"10.1016/j.dsp.2025.105441","DOIUrl":null,"url":null,"abstract":"<div><div>Multi-modal learning is an important branch in the field of deep learning area, which has been widely used for processing data from different media. The fusion of different modalities in natural images has shown significant results, but less attention has been paid to medical images of individual modalities due to data scarcity. The discussion of applications of multi-modal learning has raised great interest in the medical field, including general fusion methods, deep learning-based methods, and large language model-based methods. With the aim of describing the evolution of different models in the field of multi-modal medical imaging, this survey provides a thorough overview of representative methods and related applications. In this study, we first introduced the concept of modality and the development of multi-modal learning, then listed the commonly used medical modalities and fusion strategies. After that, we described the branches of multi-modal models in the medical imaging field in detail, along with various application scenarios and open datasets. We hope our survey will provide guidance for readers to understand typical models and the growing trend within the medical imaging domain.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"167 ","pages":"Article 105441"},"PeriodicalIF":2.9000,"publicationDate":"2025-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-modal learning methods in medical imaging area: A survey\",\"authors\":\"Yibo Sun, Weitong Chen, Zhe Sun\",\"doi\":\"10.1016/j.dsp.2025.105441\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Multi-modal learning is an important branch in the field of deep learning area, which has been widely used for processing data from different media. The fusion of different modalities in natural images has shown significant results, but less attention has been paid to medical images of individual modalities due to data scarcity. The discussion of applications of multi-modal learning has raised great interest in the medical field, including general fusion methods, deep learning-based methods, and large language model-based methods. With the aim of describing the evolution of different models in the field of multi-modal medical imaging, this survey provides a thorough overview of representative methods and related applications. In this study, we first introduced the concept of modality and the development of multi-modal learning, then listed the commonly used medical modalities and fusion strategies. After that, we described the branches of multi-modal models in the medical imaging field in detail, along with various application scenarios and open datasets. We hope our survey will provide guidance for readers to understand typical models and the growing trend within the medical imaging domain.</div></div>\",\"PeriodicalId\":51011,\"journal\":{\"name\":\"Digital Signal Processing\",\"volume\":\"167 \",\"pages\":\"Article 105441\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-07-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1051200425004634\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1051200425004634","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Multi-modal learning methods in medical imaging area: A survey
Multi-modal learning is an important branch in the field of deep learning area, which has been widely used for processing data from different media. The fusion of different modalities in natural images has shown significant results, but less attention has been paid to medical images of individual modalities due to data scarcity. The discussion of applications of multi-modal learning has raised great interest in the medical field, including general fusion methods, deep learning-based methods, and large language model-based methods. With the aim of describing the evolution of different models in the field of multi-modal medical imaging, this survey provides a thorough overview of representative methods and related applications. In this study, we first introduced the concept of modality and the development of multi-modal learning, then listed the commonly used medical modalities and fusion strategies. After that, we described the branches of multi-modal models in the medical imaging field in detail, along with various application scenarios and open datasets. We hope our survey will provide guidance for readers to understand typical models and the growing trend within the medical imaging domain.
期刊介绍:
Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal.
The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as:
• big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,