{"title":"CAMAF: Context-Aware Multimodal Alignment Framework for explainable lung disease risk stratification","authors":"Subba Rao Dusari, Nagendra Panini Challa","doi":"10.1016/j.eswa.2025.127398","DOIUrl":null,"url":null,"abstract":"<div><div>Lung disease remains a leading cause of morbidity and mortality worldwide, necessitating early and accurate risk stratification to improve patient outcomes. Existing approaches rely on either clinical data or imaging modalities independently, limiting their ability to capture the intricate interactions between structured and unstructured information. To overcome this limitation, we propose the Context-Aware Multimodal Alignment Framework (CAMAF), a novel AI-driven approach that dynamically integrates structured clinical data from the MIMIC-IV dataset and unstructured imaging data from the CheXpert dataset for context-aware and interpretable lung disease risk stratification. Unlike conventional multimodal fusion techniques, CAMAF introduces a Context-Gated Attention (CGA) mechanism, which adaptively aligns modality contributions based on patient-specific contexts. Additionally, CAMAF employs TabTransformer and Vision Transformer (ViT) for high-fidelity feature extraction and leverages a Transformer Encoder for enhanced multimodal fusion, ensuring superior predictive performance. Experimental results demonstrate the efficacy of CAMAF, achieving 92.78% accuracy and 0.9457 AUC-ROC, outperforming state-of-the-art machine learning baselines and traditional clinical risk scores (e.g., CURB-65, BODE Index). Robustness analysis highlights its ability to maintain performance under noisy and missing data conditions, while SHAP and Grad-CAM provide interpretable explanations for clinical decision-making. Despite its advantages, CAMAF faces challenges in handling rare pathologies, which can be addressed through data augmentation and transfer learning. This work establishes CAMAF as a novel, clinically viable AI framework, bridging the gap between multimodal data integration, interpretability, and real-world clinical applicability in lung disease risk stratification.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"279 ","pages":"Article 127398"},"PeriodicalIF":7.5000,"publicationDate":"2025-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425010206","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Lung disease remains a leading cause of morbidity and mortality worldwide, necessitating early and accurate risk stratification to improve patient outcomes. Existing approaches rely on either clinical data or imaging modalities independently, limiting their ability to capture the intricate interactions between structured and unstructured information. To overcome this limitation, we propose the Context-Aware Multimodal Alignment Framework (CAMAF), a novel AI-driven approach that dynamically integrates structured clinical data from the MIMIC-IV dataset and unstructured imaging data from the CheXpert dataset for context-aware and interpretable lung disease risk stratification. Unlike conventional multimodal fusion techniques, CAMAF introduces a Context-Gated Attention (CGA) mechanism, which adaptively aligns modality contributions based on patient-specific contexts. Additionally, CAMAF employs TabTransformer and Vision Transformer (ViT) for high-fidelity feature extraction and leverages a Transformer Encoder for enhanced multimodal fusion, ensuring superior predictive performance. Experimental results demonstrate the efficacy of CAMAF, achieving 92.78% accuracy and 0.9457 AUC-ROC, outperforming state-of-the-art machine learning baselines and traditional clinical risk scores (e.g., CURB-65, BODE Index). Robustness analysis highlights its ability to maintain performance under noisy and missing data conditions, while SHAP and Grad-CAM provide interpretable explanations for clinical decision-making. Despite its advantages, CAMAF faces challenges in handling rare pathologies, which can be addressed through data augmentation and transfer learning. This work establishes CAMAF as a novel, clinically viable AI framework, bridging the gap between multimodal data integration, interpretability, and real-world clinical applicability in lung disease risk stratification.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.