Generating Medical Report via Joint Probability Graph Reasoning

IF 3.5 1区计算机科学 Q1 Multidisciplinary

Tsinghua Science and Technology Pub Date : 2025-03-03 DOI:10.26599/TST.2024.9010058

Junsan Zhang;Ming Cheng;Xiangyang Li;Xiuxuan Shen;Yuxue Liu;Yao Wan

{"title":"Generating Medical Report via Joint Probability Graph Reasoning","authors":"Junsan Zhang;Ming Cheng;Xiangyang Li;Xiuxuan Shen;Yuxue Liu;Yao Wan","doi":"10.26599/TST.2024.9010058","DOIUrl":null,"url":null,"abstract":"In medical X-ray images, multiple abnormalities may occur frequently. However, existing report generation methods cannot efficiently extract all abnormal features, resulting in incomplete disease diagnoses when generating diagnostic reports. In real medical scenarios, there are co-occurrence relations among multiple diseases. If such co-occurrence relations are mined and integrated into the feature extraction process, the issue of missing abnormal features may be addressed. Inspired by this observation, we propose a novel method to improve the extraction of abnormal features in images through joint probability graph reasoning. Specifically, to reveal the co-occurrence relations among multiple diseases, we conduct statistical analyses on the dataset, and extract disease relationships into a probability map. Subsequently, we devise a graph reasoning network for conducting correlation-based reasoning over the features of medical images, which can facilitate the acquisition of more abnormal features. Furthermore, we introduce a gating mechanism focused on cross-modal features fusion into the current text generation model. This optimization substantially improves the model's capabilities to learn and fuse information from two distinct modalities-medical images and texts. Experimental results on the IU-X-Ray and MIMIC-CXR datasets demonstrate that our approach outperforms previous state-of-the-art methods, exhibiting the ability to generate higher quality medical image reports.","PeriodicalId":48690,"journal":{"name":"Tsinghua Science and Technology","volume":"30 4","pages":"1685-1699"},"PeriodicalIF":3.5000,"publicationDate":"2025-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10908661","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tsinghua Science and Technology","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10908661/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Multidisciplinary","Score":null,"Total":0}

引用次数: 0

Abstract

In medical X-ray images, multiple abnormalities may occur frequently. However, existing report generation methods cannot efficiently extract all abnormal features, resulting in incomplete disease diagnoses when generating diagnostic reports. In real medical scenarios, there are co-occurrence relations among multiple diseases. If such co-occurrence relations are mined and integrated into the feature extraction process, the issue of missing abnormal features may be addressed. Inspired by this observation, we propose a novel method to improve the extraction of abnormal features in images through joint probability graph reasoning. Specifically, to reveal the co-occurrence relations among multiple diseases, we conduct statistical analyses on the dataset, and extract disease relationships into a probability map. Subsequently, we devise a graph reasoning network for conducting correlation-based reasoning over the features of medical images, which can facilitate the acquisition of more abnormal features. Furthermore, we introduce a gating mechanism focused on cross-modal features fusion into the current text generation model. This optimization substantially improves the model's capabilities to learn and fuse information from two distinct modalities-medical images and texts. Experimental results on the IU-X-Ray and MIMIC-CXR datasets demonstrate that our approach outperforms previous state-of-the-art methods, exhibiting the ability to generate higher quality medical image reports.

查看原文本刊更多论文

联合概率图推理生成医疗报告

在医学x线图像中，经常会出现多种异常。然而，现有的报告生成方法不能有效地提取所有异常特征，导致在生成诊断报告时疾病诊断不完整。在真实的医疗场景中，多种疾病之间存在共现关系。如果将这种共现关系挖掘并集成到特征提取过程中，就可以解决缺失异常特征的问题。受此启发，我们提出了一种新的方法，通过联合概率图推理来提高图像中异常特征的提取。具体而言，为了揭示多种疾病之间的共现关系，我们对数据集进行统计分析，并将疾病关系提取成概率图。随后，我们设计了一个图推理网络，对医学图像的特征进行基于相关性的推理，有利于获取更多的异常特征。此外，我们在当前的文本生成模型中引入了一种专注于跨模态特征融合的门控机制。这种优化大大提高了模型从两种不同的模式（医学图像和文本）中学习和融合信息的能力。在IU-X-Ray和MIMIC-CXR数据集上的实验结果表明，我们的方法优于以前最先进的方法，显示出生成更高质量医学图像报告的能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Tsinghua Science and Technology COMPUTER SCIENCE, INFORMATION SYSTEMSCOMPU-COMPUTER SCIENCE, SOFTWARE ENGINEERING

CiteScore

10.20

自引率

10.60%

发文量

2340

期刊介绍： Tsinghua Science and Technology (Tsinghua Sci Technol) started publication in 1996. It is an international academic journal sponsored by Tsinghua University and is published bimonthly. This journal aims at presenting the up-to-date scientific achievements in computer science, electronic engineering, and other IT fields. Contributions all over the world are welcome.