Generating radiology reports via auxiliary signal guidance and a memory-driven network

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems with Applications Pub Date : 2023-09-09 DOI:10.1016/j.eswa.2023.121260

Youyuan Xue , Yun Tan , Ling Tan , Jiaohua Qin , Xuyu Xiang

{"title":"Generating radiology reports via auxiliary signal guidance and a memory-driven network","authors":"Youyuan Xue , Yun Tan , Ling Tan , Jiaohua Qin , Xuyu Xiang","doi":"10.1016/j.eswa.2023.121260","DOIUrl":null,"url":null,"abstract":"<div><p><span><span><span>Automatically generating medical image reports is a gratifying task. For doctors, it can reduce the heavy burden of writing reports, and for patients, it can reduce the waiting time for reports; it can also avoid misdiagnosis and missed diagnoses caused by human factors. However, this task still faces enormous challenges due to the problem of visual and textual data bias and the complex relationships among the components of medical reports. To this end, in this work, we propose an auxiliary signal guidance and memory-driven (ASGMD) network that can be used to generate medical reports automatically. It includes three modules: an Auxiliary Signal Guidance Module (ASG), a text sequential </span>attention mechanism (TSAM) module, and a Memory Mechanism-Driven Decoding Module (MMDD). Given a medical image of a patient, </span>radiologists<span> usually focus on the abnormal area first, then browse the global information included in the image and write a corresponding report. Similar to the above working mode, the ASG module enhances the features of the abnormal areas of medical images by introducing auxiliary signals that alleviate the problem of visual data bias. We design a novel TSAM module that explores the consistency of medical report context and enhances essential medical information in reports to reduce textual data bias. Finally, the MMDD module integrates visual and textual knowledge to achieve dynamic decoding and generate a final report. The experimental results show that the proposed method outperforms state-of-the-art models on various evaluation metrics on the two public datasets, IU-Xray and MIMIC-CXR. To make our results reproducible, our code has been released at </span></span><span>https://github.com/shangchengLu/ASGMDN</span><svg><path></path></svg>.</p></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"237 ","pages":"Article 121260"},"PeriodicalIF":7.5000,"publicationDate":"2023-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417423017621","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 1

Abstract

Automatically generating medical image reports is a gratifying task. For doctors, it can reduce the heavy burden of writing reports, and for patients, it can reduce the waiting time for reports; it can also avoid misdiagnosis and missed diagnoses caused by human factors. However, this task still faces enormous challenges due to the problem of visual and textual data bias and the complex relationships among the components of medical reports. To this end, in this work, we propose an auxiliary signal guidance and memory-driven (ASGMD) network that can be used to generate medical reports automatically. It includes three modules: an Auxiliary Signal Guidance Module (ASG), a text sequential attention mechanism (TSAM) module, and a Memory Mechanism-Driven Decoding Module (MMDD). Given a medical image of a patient, radiologists usually focus on the abnormal area first, then browse the global information included in the image and write a corresponding report. Similar to the above working mode, the ASG module enhances the features of the abnormal areas of medical images by introducing auxiliary signals that alleviate the problem of visual data bias. We design a novel TSAM module that explores the consistency of medical report context and enhances essential medical information in reports to reduce textual data bias. Finally, the MMDD module integrates visual and textual knowledge to achieve dynamic decoding and generate a final report. The experimental results show that the proposed method outperforms state-of-the-art models on various evaluation metrics on the two public datasets, IU-Xray and MIMIC-CXR. To make our results reproducible, our code has been released at https://github.com/shangchengLu/ASGMDN.

查看原文本刊更多论文

通过辅助信号引导和记忆驱动网络生成放射学报告

自动生成医学图像报告是一项令人满意的任务。对于医生来说，它可以减轻撰写报告的沉重负担，对于患者来说，它也可以减少等待报告的时间；它还可以避免人为因素造成的误诊和漏诊。然而，由于视觉和文本数据偏见的问题以及医疗报告组成部分之间的复杂关系，这项任务仍然面临巨大挑战。为此，在这项工作中，我们提出了一种辅助信号引导和记忆驱动（ASGMD）网络，可用于自动生成医疗报告。它包括三个模块：辅助信号引导模块（ASG）、文本顺序注意力机制（TSAM）模块和记忆机制驱动解码模块（MMDD）。给定患者的医学图像，放射科医生通常首先关注异常区域，然后浏览图像中包含的全局信息并编写相应的报告。与上述工作模式类似，ASG模块通过引入辅助信号来增强医学图像异常区域的特征，从而缓解视觉数据偏差的问题。我们设计了一个新的TSAM模块，该模块探索医疗报告上下文的一致性，并增强报告中的基本医疗信息，以减少文本数据偏差。最后，MMDD模块集成了视觉和文本知识，实现了动态解码并生成最终报告。实验结果表明，在IU-Xray和MIMIC-CXR两个公共数据集上，该方法在各种评估指标上都优于现有模型。为了使我们的结果具有可复制性，我们的代码已在https://github.com/shangchengLu/ASGMDN.

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Expert Systems with Applications 工程技术-工程：电子与电气

CiteScore

13.80

自引率

10.60%

发文量

2045

审稿时长

8.7 months

期刊介绍： Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.