{"title":"Multi-modal Disease Prediction with Hierarchical Self-supervised Learning.","authors":"Zhe Qu, Taihua Chen, Xin Zhou, Fanglin Zhu, Wei Guo, Yonghui Xu, Yixin Zhang, Lizhen Cui","doi":"10.1109/JBHI.2025.3561546","DOIUrl":null,"url":null,"abstract":"<p><p>The proliferation of healthcare data sources, including diverse imaging modalities and biochemical measurements, has created unprecedented opportunities for comprehensive disease prediction. Multi-modal clinical data, encompassing medical imaging reports, biochemical assays, and longitudinal clinical records, provides a rich foundation for developing sophisticated diagnostic models. Graph Neural Networks (GNNs) have emerged as a leading methodological framework, distinguished by their capacity to model complex inter-patient relationships and capture community structures within patient data. Despite their promise, current GNN-based approaches exhibit limitations in handling noisy, low-quality data and often impose overly restrictive graph smoothness constraints. These limitations can obscure patient-specific variations and compromise model robustness. To overcome these challenges, we propose HierSSL (Hierarchical Self-Supervised Learning), a novel multi-modal disease prediction framework that enhances representational learning through dual-scale self-supervision mechanisms operating at both local and global levels. HierSSL's architecture specifically addresses two critical aspects: 1) the capture of local inter-modality dependencies and global community patterns, and 2) the optimization of multi-modal feature integration through an innovative combination of feature consistency constraints and graph contrastive learning. Empirical evaluation across two distinct disease prediction datasets demonstrates that HierSSL achieves statistically significant performance improvements compared to state-of-the-art methods, highlighting its efficacy in robust multi-modal data integration for disease prediction tasks.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":""},"PeriodicalIF":6.7000,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Biomedical and Health Informatics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1109/JBHI.2025.3561546","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The proliferation of healthcare data sources, including diverse imaging modalities and biochemical measurements, has created unprecedented opportunities for comprehensive disease prediction. Multi-modal clinical data, encompassing medical imaging reports, biochemical assays, and longitudinal clinical records, provides a rich foundation for developing sophisticated diagnostic models. Graph Neural Networks (GNNs) have emerged as a leading methodological framework, distinguished by their capacity to model complex inter-patient relationships and capture community structures within patient data. Despite their promise, current GNN-based approaches exhibit limitations in handling noisy, low-quality data and often impose overly restrictive graph smoothness constraints. These limitations can obscure patient-specific variations and compromise model robustness. To overcome these challenges, we propose HierSSL (Hierarchical Self-Supervised Learning), a novel multi-modal disease prediction framework that enhances representational learning through dual-scale self-supervision mechanisms operating at both local and global levels. HierSSL's architecture specifically addresses two critical aspects: 1) the capture of local inter-modality dependencies and global community patterns, and 2) the optimization of multi-modal feature integration through an innovative combination of feature consistency constraints and graph contrastive learning. Empirical evaluation across two distinct disease prediction datasets demonstrates that HierSSL achieves statistically significant performance improvements compared to state-of-the-art methods, highlighting its efficacy in robust multi-modal data integration for disease prediction tasks.
期刊介绍:
IEEE Journal of Biomedical and Health Informatics publishes original papers presenting recent advances where information and communication technologies intersect with health, healthcare, life sciences, and biomedicine. Topics include acquisition, transmission, storage, retrieval, management, and analysis of biomedical and health information. The journal covers applications of information technologies in healthcare, patient monitoring, preventive care, early disease diagnosis, therapy discovery, and personalized treatment protocols. It explores electronic medical and health records, clinical information systems, decision support systems, medical and biological imaging informatics, wearable systems, body area/sensor networks, and more. Integration-related topics like interoperability, evidence-based medicine, and secure patient data are also addressed.