释放弱标记数据的潜力:异常检测和报告生成的协同进化学习框架

Jinghan Sun;Dong Wei;Zhe Xu;Donghuan Lu;Hong Liu;Hong Wang;Sotirios A. Tsaftaris;Steven McDonagh;Yefeng Zheng;Liansheng Wang
{"title":"释放弱标记数据的潜力:异常检测和报告生成的协同进化学习框架","authors":"Jinghan Sun;Dong Wei;Zhe Xu;Donghuan Lu;Hong Liu;Hong Wang;Sotirios A. Tsaftaris;Steven McDonagh;Yefeng Zheng;Liansheng Wang","doi":"10.1109/TMI.2024.3516954","DOIUrl":null,"url":null,"abstract":"Anatomical abnormality detection and report generation of chest X-ray (CXR) are two essential tasks in clinical practice. The former aims at localizing and characterizing cardiopulmonary radiological findings in CXRs, while the latter summarizes the findings in a detailed report for further diagnosis and treatment. Existing methods often focused on either task separately, ignoring their correlation. This work proposes a co-evolutionary abnormality detection and report generation (CoE-DG) framework. The framework utilizes both fully labeled (with bounding box annotations and clinical reports) and weakly labeled (with reports only) data to achieve mutual promotion between the abnormality detection and report generation tasks. Specifically, we introduce a bi-directional information interaction strategy with generator-guided information propagation (GIP) and detector-guided information propagation (DIP). For semi-supervised abnormality detection, GIP takes the informative feature extracted by the generator as an auxiliary input to the detector and uses the generator’s prediction to refine the detector’s pseudo labels. We further propose an intra-image-modal self-adaptive non-maximum suppression module (SA-NMS). This module dynamically rectifies pseudo detection labels generated by the teacher detection model with high-confidence predictions by the student. Inversely, for report generation, DIP takes the abnormalities’ categories and locations predicted by the detector as input and guidance for the generator to improve the generated reports. Finally, a co-evolutionary training strategy is implemented to iteratively conduct GIP and DIP and consistently improve both tasks’ performance. Experimental results on two public CXR datasets demonstrate CoE-DG’s superior performance to several up-to-date object detection, report generation, and unified models. Our code is available at <uri>https://github.com/jinghanSunn/CoE-DG</uri>.","PeriodicalId":94033,"journal":{"name":"IEEE transactions on medical imaging","volume":"44 4","pages":"1671-1685"},"PeriodicalIF":0.0000,"publicationDate":"2024-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Unlocking the Potential of Weakly Labeled Data: A Co-Evolutionary Learning Framework for Abnormality Detection and Report Generation\",\"authors\":\"Jinghan Sun;Dong Wei;Zhe Xu;Donghuan Lu;Hong Liu;Hong Wang;Sotirios A. Tsaftaris;Steven McDonagh;Yefeng Zheng;Liansheng Wang\",\"doi\":\"10.1109/TMI.2024.3516954\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Anatomical abnormality detection and report generation of chest X-ray (CXR) are two essential tasks in clinical practice. The former aims at localizing and characterizing cardiopulmonary radiological findings in CXRs, while the latter summarizes the findings in a detailed report for further diagnosis and treatment. Existing methods often focused on either task separately, ignoring their correlation. This work proposes a co-evolutionary abnormality detection and report generation (CoE-DG) framework. The framework utilizes both fully labeled (with bounding box annotations and clinical reports) and weakly labeled (with reports only) data to achieve mutual promotion between the abnormality detection and report generation tasks. Specifically, we introduce a bi-directional information interaction strategy with generator-guided information propagation (GIP) and detector-guided information propagation (DIP). For semi-supervised abnormality detection, GIP takes the informative feature extracted by the generator as an auxiliary input to the detector and uses the generator’s prediction to refine the detector’s pseudo labels. We further propose an intra-image-modal self-adaptive non-maximum suppression module (SA-NMS). This module dynamically rectifies pseudo detection labels generated by the teacher detection model with high-confidence predictions by the student. Inversely, for report generation, DIP takes the abnormalities’ categories and locations predicted by the detector as input and guidance for the generator to improve the generated reports. Finally, a co-evolutionary training strategy is implemented to iteratively conduct GIP and DIP and consistently improve both tasks’ performance. Experimental results on two public CXR datasets demonstrate CoE-DG’s superior performance to several up-to-date object detection, report generation, and unified models. Our code is available at <uri>https://github.com/jinghanSunn/CoE-DG</uri>.\",\"PeriodicalId\":94033,\"journal\":{\"name\":\"IEEE transactions on medical imaging\",\"volume\":\"44 4\",\"pages\":\"1671-1685\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-12-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE transactions on medical imaging\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10798513/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on medical imaging","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10798513/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

胸部x线解剖异常检测和报告生成是临床工作中的两项重要任务。前者旨在定位和描述cxr中的心肺放射学发现,而后者则在详细报告中总结发现,以供进一步诊断和治疗。现有的方法往往分别关注这两个任务,而忽略了它们之间的相关性。这项工作提出了一个共同进化异常检测和报告生成(CoE-DG)框架。该框架利用全标注(带边界框注释和临床报告)和弱标注(仅带报告)数据,实现异常检测和报告生成任务之间的相互促进。具体来说,我们引入了一种发生器引导信息传播(GIP)和探测器引导信息传播(DIP)的双向信息交互策略。对于半监督异常检测,GIP将生成器提取的信息特征作为检测器的辅助输入,并使用生成器的预测来细化检测器的伪标签。我们进一步提出了一种图像模态内自适应非最大抑制模块(SA-NMS)。该模块通过学生的高置信度预测动态校正教师检测模型生成的伪检测标签。相反,对于报告生成,DIP将检测器预测的异常类别和位置作为生成器的输入和指导,以改进生成的报告。最后,实现了一种协同进化训练策略,迭代地执行GIP和DIP,并持续地提高这两个任务的性能。在两个公共CXR数据集上的实验结果表明,CoE-DG在几种最新的目标检测、报告生成和统一模型方面具有优越的性能。我们的代码可在https://github.com/jinghanSunn/CoE-DG上获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Unlocking the Potential of Weakly Labeled Data: A Co-Evolutionary Learning Framework for Abnormality Detection and Report Generation
Anatomical abnormality detection and report generation of chest X-ray (CXR) are two essential tasks in clinical practice. The former aims at localizing and characterizing cardiopulmonary radiological findings in CXRs, while the latter summarizes the findings in a detailed report for further diagnosis and treatment. Existing methods often focused on either task separately, ignoring their correlation. This work proposes a co-evolutionary abnormality detection and report generation (CoE-DG) framework. The framework utilizes both fully labeled (with bounding box annotations and clinical reports) and weakly labeled (with reports only) data to achieve mutual promotion between the abnormality detection and report generation tasks. Specifically, we introduce a bi-directional information interaction strategy with generator-guided information propagation (GIP) and detector-guided information propagation (DIP). For semi-supervised abnormality detection, GIP takes the informative feature extracted by the generator as an auxiliary input to the detector and uses the generator’s prediction to refine the detector’s pseudo labels. We further propose an intra-image-modal self-adaptive non-maximum suppression module (SA-NMS). This module dynamically rectifies pseudo detection labels generated by the teacher detection model with high-confidence predictions by the student. Inversely, for report generation, DIP takes the abnormalities’ categories and locations predicted by the detector as input and guidance for the generator to improve the generated reports. Finally, a co-evolutionary training strategy is implemented to iteratively conduct GIP and DIP and consistently improve both tasks’ performance. Experimental results on two public CXR datasets demonstrate CoE-DG’s superior performance to several up-to-date object detection, report generation, and unified models. Our code is available at https://github.com/jinghanSunn/CoE-DG.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信