Yingying Huang , Yang Si , Bingliang Hu , Jiang Shen , Linshen Xie , Dongsheng Wu , Quan Wang
{"title":"Retrieval-based adaptive fusion strategy for medical report generation","authors":"Yingying Huang , Yang Si , Bingliang Hu , Jiang Shen , Linshen Xie , Dongsheng Wu , Quan Wang","doi":"10.1016/j.cmpb.2025.108907","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and Objective:</h3><div>Retrieval-based medical report generation methods attempt to improve efficiency by reusing historical reports, but their fixed feature-concatenation strategies often introduce cross-case redundancy. Moreover, most methods are designed for low-resolution X-ray images, and their evaluation metrics rely on textual similarity and overlook the implications of misdescription. To address these, we first constructed a high-resolution CT-report dataset, comprising 9 categories of chest CT scans and corresponding reports from 505 patients. Then, we propose RAFS, a retrieval-based adaptive fusion strategy, to dynamically balance contributions from generation and retrieval modules. Finally, we propose DICE, a dual-perspective integrated clinical evaluation including consensus-based positive scoring and penalties of misdescription.</div></div><div><h3>Methods:</h3><div>RAFS integrates an attention module to calculate the similarity between the current generated word’s hidden state and the retrieved text, passing the result through a fully connected layer to obtain retrieval probabilities. After, obtained attention weights are feed the Sigmoid function and its result for fusing the generation probabilities and retrieval probabilities.</div></div><div><h3>Results:</h3><div>RAFS achieves superior performance with BLEU-4, METEOR, ROUGE_L, CIDEr and the average of DICE scores of 45.8, 32.9, 59.1, 79.3 and 64.6 in the CT report generation task, outperforming existing methods. methods.</div></div><div><h3>Conclusion:</h3><div>RAFS significantly enhances the clinical interpretability of generated reports, with future work dedicated to optimizing the characterization of local pathological lesions.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"270 ","pages":"Article 108907"},"PeriodicalIF":4.9000,"publicationDate":"2025-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169260725003244","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Background and Objective:
Retrieval-based medical report generation methods attempt to improve efficiency by reusing historical reports, but their fixed feature-concatenation strategies often introduce cross-case redundancy. Moreover, most methods are designed for low-resolution X-ray images, and their evaluation metrics rely on textual similarity and overlook the implications of misdescription. To address these, we first constructed a high-resolution CT-report dataset, comprising 9 categories of chest CT scans and corresponding reports from 505 patients. Then, we propose RAFS, a retrieval-based adaptive fusion strategy, to dynamically balance contributions from generation and retrieval modules. Finally, we propose DICE, a dual-perspective integrated clinical evaluation including consensus-based positive scoring and penalties of misdescription.
Methods:
RAFS integrates an attention module to calculate the similarity between the current generated word’s hidden state and the retrieved text, passing the result through a fully connected layer to obtain retrieval probabilities. After, obtained attention weights are feed the Sigmoid function and its result for fusing the generation probabilities and retrieval probabilities.
Results:
RAFS achieves superior performance with BLEU-4, METEOR, ROUGE_L, CIDEr and the average of DICE scores of 45.8, 32.9, 59.1, 79.3 and 64.6 in the CT report generation task, outperforming existing methods. methods.
Conclusion:
RAFS significantly enhances the clinical interpretability of generated reports, with future work dedicated to optimizing the characterization of local pathological lesions.
期刊介绍:
To encourage the development of formal computing methods, and their application in biomedical research and medical practice, by illustration of fundamental principles in biomedical informatics research; to stimulate basic research into application software design; to report the state of research of biomedical information processing projects; to report new computer methodologies applied in biomedical areas; the eventual distribution of demonstrable software to avoid duplication of effort; to provide a forum for discussion and improvement of existing software; to optimize contact between national organizations and regional user groups by promoting an international exchange of information on formal methods, standards and software in biomedicine.
Computer Methods and Programs in Biomedicine covers computing methodology and software systems derived from computing science for implementation in all aspects of biomedical research and medical practice. It is designed to serve: biochemists; biologists; geneticists; immunologists; neuroscientists; pharmacologists; toxicologists; clinicians; epidemiologists; psychiatrists; psychologists; cardiologists; chemists; (radio)physicists; computer scientists; programmers and systems analysts; biomedical, clinical, electrical and other engineers; teachers of medical informatics and users of educational software.