Development of a novel artificial intelligence clinical decision support tool for hand surgery: HandRAG.

IF 0.5 Q4 SURGERY

Journal of Hand and Microsurgery Pub Date : 2025-06-11 eCollection Date: 2025-07-01 DOI:10.1016/j.jham.2025.100293

Berk B Ozmen, Nishant Singh, Kavach Shah, Ibrahim Berber, Damanjit Singh, Eugene Pinsky, Antonio Rampazzo, Graham S Schwarz

{"title":"Development of a novel artificial intelligence clinical decision support tool for hand surgery: HandRAG.","authors":"Berk B Ozmen, Nishant Singh, Kavach Shah, Ibrahim Berber, Damanjit Singh, Eugene Pinsky, Antonio Rampazzo, Graham S Schwarz","doi":"10.1016/j.jham.2025.100293","DOIUrl":null,"url":null,"abstract":"Purpose: Hand surgery decision-making requires integration of complex anatomical understanding, diverse patient-specific factors, and nuanced operative techniques. While artificial intelligence (AI), large language models (LLMs), and retrieval-augmented generation (RAG) models have advanced significantly in various fields, no AI-driven clinical decision support systems currently exist for hand surgery. A novel retrieval-enhanced AI large language model specifically tailored for hand surgery was developed, capable of effectively utilizing peer-reviewed published hand surgery literature for clinical decision support in real-time at point of care.Methods: An AI clinical decision support system was developed integrating all available open-access 4510 peer-reviewed hand surgery publications from 2000 to 2024 identified through hand surgery-relevant keywords. Documents were processed using a hierarchical pipeline based on the RAPTOR methodology, which breaks down large texts into smaller segments to enhance accurate retrieval. The system was evaluated using 15 standardized clinical queries assessed using automated computational metrics for correctness and semantic similarity to source documents.Results: The AI system demonstrated consistent performance with an average G-Eval correctness score of 0.79, SEM with an average similarity score of 0.75 (range: 0.54-0.86) and average maximum similarity score of 0.80 (range: 0.56-0.91), predominantly at moderate confidence levels. Generated recommendations were contextually appropriate and reliably linked to relevant hand surgery literature, providing accurate and clinically meaningful guidance.Conclusion: The AI system, HandRAG, incorporating RAG and LLM approach offers potential benefits for evidence-based clinical decision support and education in hand surgery.","PeriodicalId":45368,"journal":{"name":"Journal of Hand and Microsurgery","volume":"17 4","pages":"100293"},"PeriodicalIF":0.5000,"publicationDate":"2025-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12210289/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hand and Microsurgery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.jham.2025.100293","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/7/1 0:00:00","PubModel":"eCollection","JCR":"Q4","JCRName":"SURGERY","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose: Hand surgery decision-making requires integration of complex anatomical understanding, diverse patient-specific factors, and nuanced operative techniques. While artificial intelligence (AI), large language models (LLMs), and retrieval-augmented generation (RAG) models have advanced significantly in various fields, no AI-driven clinical decision support systems currently exist for hand surgery. A novel retrieval-enhanced AI large language model specifically tailored for hand surgery was developed, capable of effectively utilizing peer-reviewed published hand surgery literature for clinical decision support in real-time at point of care.

Methods: An AI clinical decision support system was developed integrating all available open-access 4510 peer-reviewed hand surgery publications from 2000 to 2024 identified through hand surgery-relevant keywords. Documents were processed using a hierarchical pipeline based on the RAPTOR methodology, which breaks down large texts into smaller segments to enhance accurate retrieval. The system was evaluated using 15 standardized clinical queries assessed using automated computational metrics for correctness and semantic similarity to source documents.

Results: The AI system demonstrated consistent performance with an average G-Eval correctness score of 0.79, SEM with an average similarity score of 0.75 (range: 0.54-0.86) and average maximum similarity score of 0.80 (range: 0.56-0.91), predominantly at moderate confidence levels. Generated recommendations were contextually appropriate and reliably linked to relevant hand surgery literature, providing accurate and clinically meaningful guidance.

Conclusion: The AI system, HandRAG, incorporating RAG and LLM approach offers potential benefits for evidence-based clinical decision support and education in hand surgery.

查看原文本刊更多论文

开发一种新型人工智能手外科临床决策支持工具：HandRAG。

目的：手部手术决策需要综合复杂的解剖学知识、不同的患者特异性因素和细致入微的手术技术。虽然人工智能（AI）、大型语言模型（llm）和检索增强生成（RAG）模型在各个领域都取得了重大进展，但目前还没有人工智能驱动的手外科临床决策支持系统。开发了一种专门为手外科量身定制的新型检索增强人工智能大语言模型，能够有效地利用同行评审的已发表手外科文献，在护理点实时为临床决策提供支持。方法：整合2000年至2024年开放获取的4510篇同行评议手外科出版物，通过手外科相关关键词进行检索，构建人工智能临床决策支持系统。使用基于RAPTOR方法的分层管道处理文档，该方法将大文本分解为较小的部分，以提高检索的准确性。系统使用15个标准化临床查询进行评估，使用自动计算指标评估源文档的正确性和语义相似性。结果：人工智能系统表现出一致的性能，平均G-Eval正确性得分为0.79，平均相似度得分为0.75（范围：0.54-0.86），平均最大相似度得分为0.80（范围：0.56-0.91），主要在中等置信水平下。生成的建议在情境上是合适的，并与相关手外科文献可靠地联系在一起，提供准确和有临床意义的指导。结论：人工智能系统HandRAG结合了RAG和LLM方法，为手外科循证临床决策支持和教育提供了潜在的好处。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Hand and Microsurgery SURGERY-

CiteScore

1.00

自引率

25.00%

发文量