Assessing DeepSeek-R1 for Clinical Decision Support in Multidisciplinary Laboratory Medicine.

IF 2.4 3区 医学 Q2 HEALTH CARE SCIENCES & SERVICES
Journal of Multidisciplinary Healthcare Pub Date : 2025-08-12 eCollection Date: 2025-01-01 DOI:10.2147/JMDH.S538253
Qinpeng Li, Lili Zhan, Xinjian Cai
{"title":"Assessing DeepSeek-R1 for Clinical Decision Support in Multidisciplinary Laboratory Medicine.","authors":"Qinpeng Li, Lili Zhan, Xinjian Cai","doi":"10.2147/JMDH.S538253","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Recent advancements in artificial intelligence (AI), particularly with large language models (LLMs), are transforming healthcare by enhancing diagnostic decision-making and clinical workflows. The application of LLMs like DeepSeek-R1 in clinical laboratory medicine demonstrates potential for improving diagnostic accuracy, supporting decision-making, and optimizing patient care.</p><p><strong>Objective: </strong>This study evaluates the performance of DeepSeek-R1 in analyzing clinical laboratory cases and assisting with medical decision-making. The focus is on assessing its accuracy and completeness in generating diagnostic hypotheses, differential diagnoses, and diagnostic workups across diverse clinical cases.</p><p><strong>Methods: </strong>We analyzed 100 clinical cases from <i>Clinical Laboratory Medicine Case Studies</i>, which includes comprehensive case histories and laboratory findings. DeepSeek-R1 was queried independently for each case three times, with three specific questions regarding diagnosis, differential diagnoses, and diagnostic tests. The outputs were assessed for accuracy and completeness by senior clinical laboratory physicians.</p><p><strong>Results: </strong>DeepSeek-R1 achieved an overall accuracy of 72.9% (95% CI [69.9%, 75.7%]) and completeness of 73.4% (95% CI [70.5%, 76.2%]). Performance varied by question type: the highest accuracy was observed for diagnostic hypotheses (85.7%, 95% CI [81.2%, 89.2%]) and the lowest for differential diagnoses (55.0%, 95% CI [49.3%, 60.5%]). Notable variations in performance were also seen across disease categories, with the best performance observed in genetic and obstetric diagnostics (accuracy 93.1%, 95% CI [84.0%, 97.3%]; completeness 86.1%, 95% CI [76.4%, 92.3%]).</p><p><strong>Conclusion: </strong>DeepSeek-R1 demonstrates potential for a decision-support tool in clinical laboratory medicine, particularly in generating diagnostic hypotheses and recommending diagnostic workups. However, its performance in differential diagnosis and handling specific clinical nuances remains limited. Future work should focus on expanding training data, integrating clinical ontologies, and incorporating physician feedback to improve real-world applicability. DeepSeek-R1 and the new versions under development may be promising tools for non-medical professionals and professionals in medical laboratory diagnoses.</p>","PeriodicalId":16357,"journal":{"name":"Journal of Multidisciplinary Healthcare","volume":"18 ","pages":"4979-4988"},"PeriodicalIF":2.4000,"publicationDate":"2025-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12357597/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Multidisciplinary Healthcare","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2147/JMDH.S538253","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Recent advancements in artificial intelligence (AI), particularly with large language models (LLMs), are transforming healthcare by enhancing diagnostic decision-making and clinical workflows. The application of LLMs like DeepSeek-R1 in clinical laboratory medicine demonstrates potential for improving diagnostic accuracy, supporting decision-making, and optimizing patient care.

Objective: This study evaluates the performance of DeepSeek-R1 in analyzing clinical laboratory cases and assisting with medical decision-making. The focus is on assessing its accuracy and completeness in generating diagnostic hypotheses, differential diagnoses, and diagnostic workups across diverse clinical cases.

Methods: We analyzed 100 clinical cases from Clinical Laboratory Medicine Case Studies, which includes comprehensive case histories and laboratory findings. DeepSeek-R1 was queried independently for each case three times, with three specific questions regarding diagnosis, differential diagnoses, and diagnostic tests. The outputs were assessed for accuracy and completeness by senior clinical laboratory physicians.

Results: DeepSeek-R1 achieved an overall accuracy of 72.9% (95% CI [69.9%, 75.7%]) and completeness of 73.4% (95% CI [70.5%, 76.2%]). Performance varied by question type: the highest accuracy was observed for diagnostic hypotheses (85.7%, 95% CI [81.2%, 89.2%]) and the lowest for differential diagnoses (55.0%, 95% CI [49.3%, 60.5%]). Notable variations in performance were also seen across disease categories, with the best performance observed in genetic and obstetric diagnostics (accuracy 93.1%, 95% CI [84.0%, 97.3%]; completeness 86.1%, 95% CI [76.4%, 92.3%]).

Conclusion: DeepSeek-R1 demonstrates potential for a decision-support tool in clinical laboratory medicine, particularly in generating diagnostic hypotheses and recommending diagnostic workups. However, its performance in differential diagnosis and handling specific clinical nuances remains limited. Future work should focus on expanding training data, integrating clinical ontologies, and incorporating physician feedback to improve real-world applicability. DeepSeek-R1 and the new versions under development may be promising tools for non-medical professionals and professionals in medical laboratory diagnoses.

评估DeepSeek-R1在多学科检验医学中的临床决策支持。
背景:人工智能(AI)的最新进展,特别是大型语言模型(llm),正在通过增强诊断决策和临床工作流程来改变医疗保健。像DeepSeek-R1这样的llm在临床检验医学中的应用显示了提高诊断准确性、支持决策和优化患者护理的潜力。目的:评价DeepSeek-R1在分析临床实验室病例和辅助医疗决策方面的性能。重点是评估其准确性和完整性在产生诊断假设,鉴别诊断,和诊断工作跨越不同的临床病例。方法:对《临床检验医学病例研究》中的100例临床病例进行分析,包括全面的病例史和实验室结果。DeepSeek-R1对每个病例进行了三次独立查询,包括关于诊断、鉴别诊断和诊断测试的三个具体问题。结果由高级临床实验室医师评估其准确性和完整性。结果:DeepSeek-R1的总体准确率为72.9% (95% CI[69.9%, 75.7%]),完整性为73.4% (95% CI[70.5%, 76.2%])。表现因问题类型而异:诊断假设的准确率最高(85.7%,95% CI[81.2%, 89.2%]),鉴别诊断的准确率最低(55.0%,95% CI[49.3%, 60.5%])。不同疾病类别的表现也存在显著差异,其中遗传和产科诊断表现最佳(准确性93.1%,95% CI[84.0%, 97.3%];完整性86.1%,95% CI[76.4%, 92.3%])。结论:DeepSeek-R1显示了在临床检验医学中决策支持工具的潜力,特别是在产生诊断假设和推荐诊断检查方面。然而,它在鉴别诊断和处理特定临床细微差别方面的表现仍然有限。未来的工作应侧重于扩展训练数据,整合临床本体,并纳入医生反馈以提高现实世界的适用性。DeepSeek-R1和正在开发的新版本可能是非医学专业人员和医学实验室诊断专业人员的有前途的工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Multidisciplinary Healthcare
Journal of Multidisciplinary Healthcare Nursing-General Nursing
CiteScore
4.60
自引率
3.00%
发文量
287
审稿时长
16 weeks
期刊介绍: The Journal of Multidisciplinary Healthcare (JMDH) aims to represent and publish research in healthcare areas delivered by practitioners of different disciplines. This includes studies and reviews conducted by multidisciplinary teams as well as research which evaluates or reports the results or conduct of such teams or healthcare processes in general. The journal covers a very wide range of areas and we welcome submissions from practitioners at all levels and from all over the world. Good healthcare is not bounded by person, place or time and the journal aims to reflect this. The JMDH is published as an open-access journal to allow this wide range of practical, patient relevant research to be immediately available to practitioners who can access and use it immediately upon publication.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信