{"title":"Multimodal survival analysis using optimal transport matching and global-local feature fusion","authors":"Bowen Sun, Yanjun Peng, Yanglei Ge","doi":"10.1016/j.dsp.2025.105119","DOIUrl":null,"url":null,"abstract":"<div><div>The multimodal survival analysis aims to predict patient's mortality risk using multiple modality data. Unlike unimodal task, the essence of multimodal survival prediction is to efficiently integrate the information of different modalities to make more accurate judgments. Despite recent advancements, there are still some serious problems waiting to be properly addressed, such as: (1) there are huge structural differences between gigapixel pathological images and radiological or genetic data. (2) existing multiple instance methods mainly focus on local feature representation, often neglecting global features such as tissue spatial information. In order to reasonably solve these problems, we propose a multimodal survival framework, called the optimal transport (OT) matching and global-local feature fusion (OTGL) framework. Specifically, we first use specially crafted encoders for extracting the class tokens and instance tokens from different data. Then, by applying the optimal transport to the instance tokens of different modalities, the weighted features can be obtained. After that, we perform feature fusion for class tokens and the processed local features to derive the final features that can be used to predict risk score. And we use hybrid loss function to train our OTGL and apply it to pathology-radiomic and pathology-genomic cancer datasets, many experiments demonstrate that the proposed OTGL have better performance than existing state-of-the-art methods. In practical medical settings, our model can aid clinicians in identifying high-risk patients and personalizing treatment plans. The source code will be made available at <span><span>https://github.com/2018213444/OTGLmodel</span><svg><path></path></svg></span> upon acceptance of the manuscript for publication.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"161 ","pages":"Article 105119"},"PeriodicalIF":2.9000,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1051200425001411","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
The multimodal survival analysis aims to predict patient's mortality risk using multiple modality data. Unlike unimodal task, the essence of multimodal survival prediction is to efficiently integrate the information of different modalities to make more accurate judgments. Despite recent advancements, there are still some serious problems waiting to be properly addressed, such as: (1) there are huge structural differences between gigapixel pathological images and radiological or genetic data. (2) existing multiple instance methods mainly focus on local feature representation, often neglecting global features such as tissue spatial information. In order to reasonably solve these problems, we propose a multimodal survival framework, called the optimal transport (OT) matching and global-local feature fusion (OTGL) framework. Specifically, we first use specially crafted encoders for extracting the class tokens and instance tokens from different data. Then, by applying the optimal transport to the instance tokens of different modalities, the weighted features can be obtained. After that, we perform feature fusion for class tokens and the processed local features to derive the final features that can be used to predict risk score. And we use hybrid loss function to train our OTGL and apply it to pathology-radiomic and pathology-genomic cancer datasets, many experiments demonstrate that the proposed OTGL have better performance than existing state-of-the-art methods. In practical medical settings, our model can aid clinicians in identifying high-risk patients and personalizing treatment plans. The source code will be made available at https://github.com/2018213444/OTGLmodel upon acceptance of the manuscript for publication.
期刊介绍:
Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal.
The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as:
• big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,