Vishwanatha M Rao, Serena Zhang, Julian N Acosta, Subathra Adithan, Pranav Rajpurkar
{"title":"ReXErr: Synthesizing Clinically Meaningful Errors in Diagnostic Radiology Reports.","authors":"Vishwanatha M Rao, Serena Zhang, Julian N Acosta, Subathra Adithan, Pranav Rajpurkar","doi":"10.1142/9789819807024_0006","DOIUrl":null,"url":null,"abstract":"<p><p>Accurately interpreting medical images and writing radiology reports is a critical but challenging task in healthcare. Both human-written and AI-generated reports can contain errors, ranging from clinical inaccuracies to linguistic mistakes. To address this, we introduce ReXErr, a methodology that leverages Large Language Models to generate representative errors within chest X-ray reports. Working with board-certified radiologists, we developed error categories that capture common mistakes in both human and AI-generated reports. Our approach uses a novel sampling scheme to inject diverse errors while maintaining clinical plausibility. ReXErr demonstrates consistency across error categories and produces errors that closely mimic those found in real-world scenarios. This method has the potential to aid in the development and evaluation of report correction algorithms, potentially enhancing the quality and reliability of radiology reporting.</p>","PeriodicalId":34954,"journal":{"name":"Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing","volume":"30 ","pages":"70-81"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/9789819807024_0006","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 0
Abstract
Accurately interpreting medical images and writing radiology reports is a critical but challenging task in healthcare. Both human-written and AI-generated reports can contain errors, ranging from clinical inaccuracies to linguistic mistakes. To address this, we introduce ReXErr, a methodology that leverages Large Language Models to generate representative errors within chest X-ray reports. Working with board-certified radiologists, we developed error categories that capture common mistakes in both human and AI-generated reports. Our approach uses a novel sampling scheme to inject diverse errors while maintaining clinical plausibility. ReXErr demonstrates consistency across error categories and produces errors that closely mimic those found in real-world scenarios. This method has the potential to aid in the development and evaluation of report correction algorithms, potentially enhancing the quality and reliability of radiology reporting.
准确解读医学影像和撰写放射学报告是医疗保健领域一项至关重要但又极具挑战性的任务。人工撰写的报告和人工智能生成的报告都可能包含错误,从临床不准确到语言错误不等。为了解决这个问题,我们引入了 ReXErr,这是一种利用大型语言模型生成胸部 X 光报告中代表性错误的方法。我们与获得认证的放射科医生合作,开发了错误类别,可捕捉人类和人工智能生成的报告中的常见错误。我们的方法采用了一种新颖的抽样方案,在保持临床合理性的同时注入各种错误。ReXErr 在不同的错误类别中表现出一致性,所产生的错误与真实世界中发现的错误非常相似。这种方法有望帮助开发和评估报告更正算法,从而提高放射学报告的质量和可靠性。