Robust Multimodal Fusion for Survival Prediction in Cancer Patients.

IF 2.5 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY
Cancer Informatics Pub Date : 2025-09-27 eCollection Date: 2025-01-01 DOI:10.1177/11769351251376192
Dominic Flack, Aakash Tripathi, Asim Waqas, Ghulam Rasool, Dimah Dera
{"title":"Robust Multimodal Fusion for Survival Prediction in Cancer Patients.","authors":"Dominic Flack, Aakash Tripathi, Asim Waqas, Ghulam Rasool, Dimah Dera","doi":"10.1177/11769351251376192","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>Multimodal deep learning models have the potential to significantly improve survival predictions and treatment planning for cancer patients. These models integrate diverse data modalities using early, intermediate, or late fusion techniques. However, many existing multimodal models either underperform or show only marginal improvements over unimodal models. To establish the true efficacy of multimodal survival prediction models, it is critical to demonstrate consistent and substantial advantages over unimodal counterparts.</p><p><strong>Methods: </strong>In this paper, we introduce the Robust Multimodal Survival Model (RMSurv), a novel discrete late fusion model that leverages synthetic data generation to compute time-dependent weights for various modalities. RMSurv utilizes up to 6 distinct data modalities from The Cancer Genome Atlas Program (TCGA) non-small cell lung cancer and the TCGA pan-cancer datasets to predict overall survival over a period of 10 years. The key innovations of RMSurv are the calculation of time-dependent late fusion weights using a synthetically generated dataset and a new statistical feature normalization technique to enhance the interpretability and accuracy of discrete survival predictions. We evaluate the performance of the proposed method and several alternatives with cross validation using the concordance index, and vary the number of modalities included. We also create a late fusion simulation to highlight the complex relationships of multimodal fusion.</p><p><strong>Results: </strong>In our experiments, RMSurv outperforms the best unimodal model's Concordance index (C-Index) by 0.0273 on the 6-modal TCGA Lung Adenocarcinoma (LUAD) dataset. Existing late and early fusion methods improved the C-index by only 0.0143 and 0.0072, respectively. RMSurv also performs best on the combined TCGA non-small-cell lung cancer dataset and the TCGA pan-cancer dataset.</p><p><strong>Conclusions: </strong>These advancements underscore RMSurv's potential as a powerful approach for survival prediction, establishing robust multimodal benefits, and setting a new benchmark for survival prediction models in pan-cancer settings.</p>","PeriodicalId":35418,"journal":{"name":"Cancer Informatics","volume":"24 ","pages":"11769351251376192"},"PeriodicalIF":2.5000,"publicationDate":"2025-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12476512/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/11769351251376192","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Objectives: Multimodal deep learning models have the potential to significantly improve survival predictions and treatment planning for cancer patients. These models integrate diverse data modalities using early, intermediate, or late fusion techniques. However, many existing multimodal models either underperform or show only marginal improvements over unimodal models. To establish the true efficacy of multimodal survival prediction models, it is critical to demonstrate consistent and substantial advantages over unimodal counterparts.

Methods: In this paper, we introduce the Robust Multimodal Survival Model (RMSurv), a novel discrete late fusion model that leverages synthetic data generation to compute time-dependent weights for various modalities. RMSurv utilizes up to 6 distinct data modalities from The Cancer Genome Atlas Program (TCGA) non-small cell lung cancer and the TCGA pan-cancer datasets to predict overall survival over a period of 10 years. The key innovations of RMSurv are the calculation of time-dependent late fusion weights using a synthetically generated dataset and a new statistical feature normalization technique to enhance the interpretability and accuracy of discrete survival predictions. We evaluate the performance of the proposed method and several alternatives with cross validation using the concordance index, and vary the number of modalities included. We also create a late fusion simulation to highlight the complex relationships of multimodal fusion.

Results: In our experiments, RMSurv outperforms the best unimodal model's Concordance index (C-Index) by 0.0273 on the 6-modal TCGA Lung Adenocarcinoma (LUAD) dataset. Existing late and early fusion methods improved the C-index by only 0.0143 and 0.0072, respectively. RMSurv also performs best on the combined TCGA non-small-cell lung cancer dataset and the TCGA pan-cancer dataset.

Conclusions: These advancements underscore RMSurv's potential as a powerful approach for survival prediction, establishing robust multimodal benefits, and setting a new benchmark for survival prediction models in pan-cancer settings.

稳健性多模态融合用于癌症患者生存预测。
目的:多模态深度学习模型具有显著改善癌症患者生存预测和治疗计划的潜力。这些模型使用早期、中期或晚期融合技术集成了不同的数据模式。然而,许多现有的多模态模型要么表现不佳,要么只显示出单模态模型的边际改进。为了建立多模式生存预测模型的真正功效,证明与单模式相比具有一致和实质性的优势是至关重要的。方法:在本文中,我们介绍了鲁棒多模态生存模型(RMSurv),这是一种新颖的离散晚期融合模型,利用合成数据生成来计算各种模态的时间相关权重。RMSurv利用来自癌症基因组图谱计划(TCGA)非小细胞肺癌和TCGA泛癌症数据集的多达6种不同的数据模式来预测10年的总生存期。RMSurv的关键创新是使用合成生成的数据集计算时间相关的晚期融合权重,以及一种新的统计特征归一化技术,以提高离散生存预测的可解释性和准确性。我们评估了所提出的方法和几种替代方案的性能,使用一致性指数进行交叉验证,并改变了所包括的模式的数量。我们还创建了一个后期融合模拟,以突出多模态融合的复杂关系。结果:在我们的实验中,RMSurv在6模态TCGA肺腺癌(LUAD)数据集上比最佳单模态模型的一致性指数(C-Index)高出0.0273。现有的晚期和早期融合方法分别仅提高了0.0143和0.0072的c指数。RMSurv在TCGA非小细胞肺癌数据集和TCGA泛癌症数据集上也表现最佳。结论:这些进展强调了RMSurv作为一种强大的生存预测方法的潜力,建立了强大的多模式益处,并为泛癌症环境下的生存预测模型设定了新的基准。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Cancer Informatics
Cancer Informatics Medicine-Oncology
CiteScore
3.00
自引率
5.00%
发文量
30
审稿时长
8 weeks
期刊介绍: The field of cancer research relies on advances in many other disciplines, including omics technology, mass spectrometry, radio imaging, computer science, and biostatistics. Cancer Informatics provides open access to peer-reviewed high-quality manuscripts reporting bioinformatics analysis of molecular genetics and/or clinical data pertaining to cancer, emphasizing the use of machine learning, artificial intelligence, statistical algorithms, advanced imaging techniques, data visualization, and high-throughput technologies. As the leading journal dedicated exclusively to the report of the use of computational methods in cancer research and practice, Cancer Informatics leverages methodological improvements in systems biology, genomics, proteomics, metabolomics, and molecular biochemistry into the fields of cancer detection, treatment, classification, risk-prediction, prevention, outcome, and modeling.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信