MAGDA:多代理指南驱动的诊断协助

David Bani-Harouni, Nassir Navab, Matthias Keicher
{"title":"MAGDA:多代理指南驱动的诊断协助","authors":"David Bani-Harouni, Nassir Navab, Matthias Keicher","doi":"arxiv-2409.06351","DOIUrl":null,"url":null,"abstract":"In emergency departments, rural hospitals, or clinics in less developed\nregions, clinicians often lack fast image analysis by trained radiologists,\nwhich can have a detrimental effect on patients' healthcare. Large Language\nModels (LLMs) have the potential to alleviate some pressure from these\nclinicians by providing insights that can help them in their decision-making.\nWhile these LLMs achieve high test results on medical exams showcasing their\ngreat theoretical medical knowledge, they tend not to follow medical\nguidelines. In this work, we introduce a new approach for zero-shot\nguideline-driven decision support. We model a system of multiple LLM agents\naugmented with a contrastive vision-language model that collaborate to reach a\npatient diagnosis. After providing the agents with simple diagnostic\nguidelines, they will synthesize prompts and screen the image for findings\nfollowing these guidelines. Finally, they provide understandable\nchain-of-thought reasoning for their diagnosis, which is then self-refined to\nconsider inter-dependencies between diseases. As our method is zero-shot, it is\nadaptable to settings with rare diseases, where training data is limited, but\nexpert-crafted disease descriptions are available. We evaluate our method on\ntwo chest X-ray datasets, CheXpert and ChestX-ray 14 Longtail, showcasing\nperformance improvement over existing zero-shot methods and generalizability to\nrare diseases.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"69 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MAGDA: Multi-agent guideline-driven diagnostic assistance\",\"authors\":\"David Bani-Harouni, Nassir Navab, Matthias Keicher\",\"doi\":\"arxiv-2409.06351\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In emergency departments, rural hospitals, or clinics in less developed\\nregions, clinicians often lack fast image analysis by trained radiologists,\\nwhich can have a detrimental effect on patients' healthcare. Large Language\\nModels (LLMs) have the potential to alleviate some pressure from these\\nclinicians by providing insights that can help them in their decision-making.\\nWhile these LLMs achieve high test results on medical exams showcasing their\\ngreat theoretical medical knowledge, they tend not to follow medical\\nguidelines. In this work, we introduce a new approach for zero-shot\\nguideline-driven decision support. We model a system of multiple LLM agents\\naugmented with a contrastive vision-language model that collaborate to reach a\\npatient diagnosis. After providing the agents with simple diagnostic\\nguidelines, they will synthesize prompts and screen the image for findings\\nfollowing these guidelines. Finally, they provide understandable\\nchain-of-thought reasoning for their diagnosis, which is then self-refined to\\nconsider inter-dependencies between diseases. As our method is zero-shot, it is\\nadaptable to settings with rare diseases, where training data is limited, but\\nexpert-crafted disease descriptions are available. We evaluate our method on\\ntwo chest X-ray datasets, CheXpert and ChestX-ray 14 Longtail, showcasing\\nperformance improvement over existing zero-shot methods and generalizability to\\nrare diseases.\",\"PeriodicalId\":501479,\"journal\":{\"name\":\"arXiv - CS - Artificial Intelligence\",\"volume\":\"69 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.06351\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.06351","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

在欠发达地区的急诊科、乡村医院或诊所,临床医生往往缺乏训练有素的放射科医生对图像进行快速分析,这可能会对患者的医疗保健产生不利影响。虽然这些大型语言模型(LLM)在医学考试中取得了很高的测试成绩,展示了其丰富的医学理论知识,但它们往往并不遵循医疗指南。在这项工作中,我们引入了一种新的零镜头指南驱动决策支持方法。我们建立了一个由多个 LLM 代理组成的系统模型,这些代理使用对比性视觉语言模型进行协作,以达成对患者的诊断。在为代理提供简单的诊断指南后,它们将根据这些指南合成提示并筛选图像结果。最后,它们会为自己的诊断提供可理解的思维推理链,然后对其进行自我提炼,以考虑疾病之间的相互依赖关系。由于我们的方法是 "0-shot "式的,因此它适用于罕见疾病的环境,在这种环境中,训练数据是有限的,但可以获得专家撰写的疾病描述。我们在两个胸部 X 光数据集(CheXpert 和 ChestX-ray 14 Longtail)上对我们的方法进行了评估,结果表明我们的方法比现有的零点扫描方法性能更优,而且可以推广到其他疾病。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
MAGDA: Multi-agent guideline-driven diagnostic assistance
In emergency departments, rural hospitals, or clinics in less developed regions, clinicians often lack fast image analysis by trained radiologists, which can have a detrimental effect on patients' healthcare. Large Language Models (LLMs) have the potential to alleviate some pressure from these clinicians by providing insights that can help them in their decision-making. While these LLMs achieve high test results on medical exams showcasing their great theoretical medical knowledge, they tend not to follow medical guidelines. In this work, we introduce a new approach for zero-shot guideline-driven decision support. We model a system of multiple LLM agents augmented with a contrastive vision-language model that collaborate to reach a patient diagnosis. After providing the agents with simple diagnostic guidelines, they will synthesize prompts and screen the image for findings following these guidelines. Finally, they provide understandable chain-of-thought reasoning for their diagnosis, which is then self-refined to consider inter-dependencies between diseases. As our method is zero-shot, it is adaptable to settings with rare diseases, where training data is limited, but expert-crafted disease descriptions are available. We evaluate our method on two chest X-ray datasets, CheXpert and ChestX-ray 14 Longtail, showcasing performance improvement over existing zero-shot methods and generalizability to rare diseases.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信