Bokai Yang;Hongyang Lei;Huazhen Huang;Xinxin Han;Yunpeng Cai
{"title":"DPN: Dynamics Priori Networks for Radiology Report Generation","authors":"Bokai Yang;Hongyang Lei;Huazhen Huang;Xinxin Han;Yunpeng Cai","doi":"10.26599/TST.2023.9010134","DOIUrl":null,"url":null,"abstract":"Radiology report generation is of significant importance. Unlike standard image captioning tasks, radiology report generation faces more pronounced visual and textual biases due to constrained data availability, making it increasingly reliant on prior knowledge in this context. In this paper, we introduce a radiology report generation network termed Dynamics Priori Networks (DPN), which leverages a dynamic knowledge graph and prior knowledge. Concretely, we establish an adaptable graph network and harness both medical domain knowledge and expert insights to enhance the model's intelligence. Notably, we introduce an image-text contrastive module and an image-text matching module to enhance the quality of the generated results. Our method is evaluated on two widely available datasets: X-ray collection from Indiana University (IU X-ray) and Medical Information Mart for Intensive Care, Chest X-Ray (MIMIC-CXR), where it demonstrates superior performance, particularly excelling in critical metrics.","PeriodicalId":48690,"journal":{"name":"Tsinghua Science and Technology","volume":"30 2","pages":"600-609"},"PeriodicalIF":6.6000,"publicationDate":"2024-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10786932","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tsinghua Science and Technology","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10786932/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Multidisciplinary","Score":null,"Total":0}
引用次数: 0
Abstract
Radiology report generation is of significant importance. Unlike standard image captioning tasks, radiology report generation faces more pronounced visual and textual biases due to constrained data availability, making it increasingly reliant on prior knowledge in this context. In this paper, we introduce a radiology report generation network termed Dynamics Priori Networks (DPN), which leverages a dynamic knowledge graph and prior knowledge. Concretely, we establish an adaptable graph network and harness both medical domain knowledge and expert insights to enhance the model's intelligence. Notably, we introduce an image-text contrastive module and an image-text matching module to enhance the quality of the generated results. Our method is evaluated on two widely available datasets: X-ray collection from Indiana University (IU X-ray) and Medical Information Mart for Intensive Care, Chest X-Ray (MIMIC-CXR), where it demonstrates superior performance, particularly excelling in critical metrics.
放射学报告生成非常重要。与标准的图像标题任务不同,由于数据可用性的限制,放射学报告生成面临着更明显的视觉和文本偏差,因此在这种情况下越来越依赖于先验知识。在本文中,我们介绍了一种称为动态先验网络(DPN)的放射学报告生成网络,它利用了动态知识图谱和先验知识。具体来说,我们建立了一个可适应的图网络,并利用医学领域知识和专家见解来增强模型的智能性。值得注意的是,我们引入了图像-文本对比模块和图像-文本匹配模块,以提高生成结果的质量。我们的方法在两个广泛可用的数据集上进行了评估:我们的方法在两个广泛使用的数据集上进行了评估:印第安纳大学的 X 射线集(IU X-ray)和重症监护医学信息中心的胸部 X 射线集(MIMIC-CXR),在这两个数据集上,我们的方法表现出了卓越的性能,尤其是在关键指标方面。
期刊介绍:
Tsinghua Science and Technology (Tsinghua Sci Technol) started publication in 1996. It is an international academic journal sponsored by Tsinghua University and is published bimonthly. This journal aims at presenting the up-to-date scientific achievements in computer science, electronic engineering, and other IT fields. Contributions all over the world are welcome.