Qing Li, Zehao Li, Jingjing Song, Jianshuo Bao, Jin Yang, Zhuhong You
{"title":"InKrat: Interpretable diagnosis prediction models based on cross-modal knowledge graph semantic retrieval fusion","authors":"Qing Li, Zehao Li, Jingjing Song, Jianshuo Bao, Jin Yang, Zhuhong You","doi":"10.1016/j.inffus.2025.103546","DOIUrl":null,"url":null,"abstract":"<div><div>Although deep learning models have made breakthrough achievements in many fields, they still face some challenges in diagnostic prediction tasks in healthcare. Existing methods either use graph structures or sequence structures one-sidedly or disjointedly, failing to obtain high-quality representations of EMR data. Some knowledge-enhanced methods rely on strategies based on name or identifier matching, lacking flexibility while introducing semantically mismatched noise. On the other hand, attention-based models for interpretable analysis can only provide the importance of different factors rather than intuitive and easily understandable natural language descriptions. To address the above issues, we propose InKrat, a new KG-enhanced method. Specifically, we designed a novel temporal graph structure that models the structure and temporal information in EMR by integrating anchor nodes as a bridge. We also developed a cross-modal semantic retrieval method, utilizing a large language model (LLM) to compute the semantic similarity between the KG and medical notes, filtering the knowledge accordingly. Finally, based on the knowledge prompts, the LLM generates interpretable descriptions of the prediction results. We have extensively validated the effectiveness of InKrat through experiments on two commonly used real-world datasets. The results demonstrate that our proposed method achieves state-of-the-art performance. Our code can be found at <span><span>https://github.com/lzh-nwpu/InKrat</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"126 ","pages":"Article 103546"},"PeriodicalIF":15.5000,"publicationDate":"2025-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1566253525006189","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Although deep learning models have made breakthrough achievements in many fields, they still face some challenges in diagnostic prediction tasks in healthcare. Existing methods either use graph structures or sequence structures one-sidedly or disjointedly, failing to obtain high-quality representations of EMR data. Some knowledge-enhanced methods rely on strategies based on name or identifier matching, lacking flexibility while introducing semantically mismatched noise. On the other hand, attention-based models for interpretable analysis can only provide the importance of different factors rather than intuitive and easily understandable natural language descriptions. To address the above issues, we propose InKrat, a new KG-enhanced method. Specifically, we designed a novel temporal graph structure that models the structure and temporal information in EMR by integrating anchor nodes as a bridge. We also developed a cross-modal semantic retrieval method, utilizing a large language model (LLM) to compute the semantic similarity between the KG and medical notes, filtering the knowledge accordingly. Finally, based on the knowledge prompts, the LLM generates interpretable descriptions of the prediction results. We have extensively validated the effectiveness of InKrat through experiments on two commonly used real-world datasets. The results demonstrate that our proposed method achieves state-of-the-art performance. Our code can be found at https://github.com/lzh-nwpu/InKrat.
期刊介绍:
Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.