Structural chain of thoughts for radiology education

IF 7.6 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge-Based Systems Pub Date : 2025-09-12 DOI:10.1016/j.knosys.2025.114433

Akash Awasthi , Brandon Chung , Anh Mai Vu , Saba Khan , Ngan Le , Zhigang Deng , Rishi Agrawal , Carol C. Wu , Hien Van Nguyen

{"title":"Structural chain of thoughts for radiology education","authors":"Akash Awasthi , Brandon Chung , Anh Mai Vu , Saba Khan , Ngan Le , Zhigang Deng , Rishi Agrawal , Carol C. Wu , Hien Van Nguyen","doi":"10.1016/j.knosys.2025.114433","DOIUrl":null,"url":null,"abstract":"<div><div>Radiology education requires trainees to develop both perceptual and interpretive expertise. However, refinement of these skills is often impeded by the limited availability of mentorship, a consequence of the demanding schedules of experienced radiologists. This lack of personalized guidance makes it difficult for learners to recognize the mistakes they make, understand why those errors occurred and how to refine their perceptual processes. Many of these errors arise from subtle differences in visual attention, such as failing to fixate on an abnormality, allocating an insufficient fixation time, or overlooking an abnormality despite scanning the correct region. Although Large Language Models (LLMs) and Large Multimodal Models (LMMs) have been explored for radiology tasks, they often struggle to detect such fine-grained multimodal variations, particularly when comparing gaze behavior between experts and trainees. To address these limitations, we introduce Structural Chain of Thoughts (SCoT), a novel framework that enhances LLMs and LMMs sensitivity to nuanced multimodal differences by structuring gaze data and radiology report into a thought graph. By leveraging a structural prior, SCoT systematically identifies key perceptual and interpretive discrepancies, allowing models to provide targeted, context-aware feedback. This structured approach not only highlights missed findings but also explains the reasoning behind perceptual errors, turning them into learning opportunities. Applied within radiology education, SCoT bridges the gap between expert and novice performance, offering a scalable solution for AI-driven diagnostic training. We further contribute a simulated dataset of perceptual errors in chest X-ray (CXR) interpretation, facilitating future research into multimodal reasoning and AI-driven medical education. Unlike conventional Chain-of-Thought approaches, SCoT explicitly integrates gaze and textual information into a structured reasoning process, yielding interpretable, fine-grained, and personalized feedback tailored to the unique needs of radiology training. The code and data will be available here: <span><span>GitHub Repository</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"330 ","pages":"Article 114433"},"PeriodicalIF":7.6000,"publicationDate":"2025-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125014728","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Radiology education requires trainees to develop both perceptual and interpretive expertise. However, refinement of these skills is often impeded by the limited availability of mentorship, a consequence of the demanding schedules of experienced radiologists. This lack of personalized guidance makes it difficult for learners to recognize the mistakes they make, understand why those errors occurred and how to refine their perceptual processes. Many of these errors arise from subtle differences in visual attention, such as failing to fixate on an abnormality, allocating an insufficient fixation time, or overlooking an abnormality despite scanning the correct region. Although Large Language Models (LLMs) and Large Multimodal Models (LMMs) have been explored for radiology tasks, they often struggle to detect such fine-grained multimodal variations, particularly when comparing gaze behavior between experts and trainees. To address these limitations, we introduce Structural Chain of Thoughts (SCoT), a novel framework that enhances LLMs and LMMs sensitivity to nuanced multimodal differences by structuring gaze data and radiology report into a thought graph. By leveraging a structural prior, SCoT systematically identifies key perceptual and interpretive discrepancies, allowing models to provide targeted, context-aware feedback. This structured approach not only highlights missed findings but also explains the reasoning behind perceptual errors, turning them into learning opportunities. Applied within radiology education, SCoT bridges the gap between expert and novice performance, offering a scalable solution for AI-driven diagnostic training. We further contribute a simulated dataset of perceptual errors in chest X-ray (CXR) interpretation, facilitating future research into multimodal reasoning and AI-driven medical education. Unlike conventional Chain-of-Thought approaches, SCoT explicitly integrates gaze and textual information into a structured reasoning process, yielding interpretable, fine-grained, and personalized feedback tailored to the unique needs of radiology training. The code and data will be available here: GitHub Repository.

查看原文本刊更多论文

放射学教育的思想结构链

放射学教育要求受训者培养感性和解释性的专业知识。然而，由于经验丰富的放射科医生的日程安排，指导的有限可用性往往阻碍了这些技能的改进。由于缺乏个性化的指导，学习者很难认识到他们所犯的错误，理解这些错误发生的原因以及如何改进他们的感知过程。这些错误中有许多是由于视觉注意的细微差异引起的，例如未能注意到异常，分配的注视时间不足，或者尽管扫描了正确的区域，但忽略了异常。尽管大型语言模型（llm）和大型多模态模型（lmm）已经在放射学任务中进行了探索，但它们通常很难检测到这种细粒度的多模态变化，特别是在比较专家和受训者之间的凝视行为时。为了解决这些限制，我们引入了结构思维链（SCoT），这是一个新的框架，通过将凝视数据和放射学报告结构化成一个思维图，提高llm和lmm对细微多模态差异的敏感性。通过利用结构先验，SCoT系统地识别关键的感知和解释差异，允许模型提供有针对性的、上下文感知的反馈。这种结构化的方法不仅突出了遗漏的发现，而且解释了感知错误背后的原因，将它们转化为学习机会。应用于放射学教育，SCoT弥合了专家和新手之间的差距，为人工智能驱动的诊断培训提供了可扩展的解决方案。我们进一步提供了胸部x射线（CXR）解释中感知错误的模拟数据集，促进了未来对多模态推理和人工智能驱动的医学教育的研究。与传统的思维链方法不同，SCoT明确地将凝视和文本信息集成到一个结构化的推理过程中，产生可解释的、细粒度的、个性化的反馈，以满足放射学培训的独特需求。代码和数据可以在这里获得：GitHub Repository。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Knowledge-Based Systems 工程技术-计算机：人工智能

CiteScore

14.80

自引率

12.50%

发文量

1245

审稿时长

7.8 months

期刊介绍： Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.