Zihao Lin , Zinan Hong , Zijian Zhou , Miaojing Shi , Jin-Gang Yu , Jingping Yun , Shuangping Huang
{"title":"DC-RRG:以诊断为中心的级联放射学报告生成","authors":"Zihao Lin , Zinan Hong , Zijian Zhou , Miaojing Shi , Jin-Gang Yu , Jingping Yun , Shuangping Huang","doi":"10.1016/j.eswa.2025.129884","DOIUrl":null,"url":null,"abstract":"<div><div>Radiology report generation (RRG) aims to automatically generate clinical reports from chest Xray images, offering potential benefits in aiding diagnosis and reducing clinician workload. However, this task remains challenging due to the need for generating diagnostically accurate and comprehensive reports. Previous methods neglect explicit and fine-grained diagnosis guidance, which is critical for addressing these challenges. To this end, we introduce a novel Diagnosis Centered method for Radiology Report Generation (DC-RRG), explicitly incorporating diagnostic observations to guide report generation. Specifically, our method utilizes a cascaded inference framework, initially using a large language model to generate diagnoses, which subsequently guide the generation of comprehensive reports. We design multi-modal prompts (<em>i.e.</em>, textual instructions, visual features, and medical knowledge features) to extend the Multi-Modal Large Language Model (MLLM) to generate accurate and comprehensive diagnostic reports. Additionally, we also develop a progressive training strategy that aligns modalities in two stages: first using reports for coarse clinical priors, then using diagnoses for fine-grained details. Extensive experiments on MIMIC-CXR and IU-Xray datasets demonstrate that DC-RRG surpasses all previous methods and achieves new state-of-the-art results.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"299 ","pages":"Article 129884"},"PeriodicalIF":7.5000,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DC-RRG: Diagnosis-centered cascaded radiology report generation\",\"authors\":\"Zihao Lin , Zinan Hong , Zijian Zhou , Miaojing Shi , Jin-Gang Yu , Jingping Yun , Shuangping Huang\",\"doi\":\"10.1016/j.eswa.2025.129884\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Radiology report generation (RRG) aims to automatically generate clinical reports from chest Xray images, offering potential benefits in aiding diagnosis and reducing clinician workload. However, this task remains challenging due to the need for generating diagnostically accurate and comprehensive reports. Previous methods neglect explicit and fine-grained diagnosis guidance, which is critical for addressing these challenges. To this end, we introduce a novel Diagnosis Centered method for Radiology Report Generation (DC-RRG), explicitly incorporating diagnostic observations to guide report generation. Specifically, our method utilizes a cascaded inference framework, initially using a large language model to generate diagnoses, which subsequently guide the generation of comprehensive reports. We design multi-modal prompts (<em>i.e.</em>, textual instructions, visual features, and medical knowledge features) to extend the Multi-Modal Large Language Model (MLLM) to generate accurate and comprehensive diagnostic reports. Additionally, we also develop a progressive training strategy that aligns modalities in two stages: first using reports for coarse clinical priors, then using diagnoses for fine-grained details. Extensive experiments on MIMIC-CXR and IU-Xray datasets demonstrate that DC-RRG surpasses all previous methods and achieves new state-of-the-art results.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"299 \",\"pages\":\"Article 129884\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-10-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0957417425034992\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425034992","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
摘要
放射学报告生成(RRG)旨在从胸部x光图像自动生成临床报告,在辅助诊断和减少临床医生工作量方面提供潜在的好处。然而,由于需要生成准确和全面的诊断报告,这项任务仍然具有挑战性。以前的方法忽略了明确和细粒度的诊断指导,这对于解决这些挑战至关重要。为此,我们介绍了一种新的以诊断为中心的放射学报告生成方法(DC-RRG),明确地结合诊断观察来指导报告生成。具体来说,我们的方法利用级联推理框架,最初使用大型语言模型来生成诊断,随后指导生成综合报告。我们设计了多模态提示(即文本说明、视觉特征和医学知识特征)来扩展多模态大语言模型(multi-modal Large Language Model, MLLM),以生成准确和全面的诊断报告。此外,我们还开发了一种渐进式培训策略,将模式分为两个阶段:首先使用粗临床先验报告,然后使用细粒度细节诊断。在MIMIC-CXR和IU-Xray数据集上进行的大量实验表明,DC-RRG超越了所有以前的方法,并获得了新的最先进的结果。
Radiology report generation (RRG) aims to automatically generate clinical reports from chest Xray images, offering potential benefits in aiding diagnosis and reducing clinician workload. However, this task remains challenging due to the need for generating diagnostically accurate and comprehensive reports. Previous methods neglect explicit and fine-grained diagnosis guidance, which is critical for addressing these challenges. To this end, we introduce a novel Diagnosis Centered method for Radiology Report Generation (DC-RRG), explicitly incorporating diagnostic observations to guide report generation. Specifically, our method utilizes a cascaded inference framework, initially using a large language model to generate diagnoses, which subsequently guide the generation of comprehensive reports. We design multi-modal prompts (i.e., textual instructions, visual features, and medical knowledge features) to extend the Multi-Modal Large Language Model (MLLM) to generate accurate and comprehensive diagnostic reports. Additionally, we also develop a progressive training strategy that aligns modalities in two stages: first using reports for coarse clinical priors, then using diagnoses for fine-grained details. Extensive experiments on MIMIC-CXR and IU-Xray datasets demonstrate that DC-RRG surpasses all previous methods and achieves new state-of-the-art results.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.