从精神分裂症谱系和情绪障碍患者的药物性帕金森病病例报告中验证大型语言模型与人工信息提取：概念证明研究。

IF 4.1 Q2 PSYCHIATRY

Schizophrenia (Heidelberg, Germany) Pub Date : 2025-03-20 DOI:10.1038/s41537-025-00601-5

Sebastian Volkmer, Alina Glück, Andreas Meyer-Lindenberg, Emanuel Schwarz, Dusan Hirjak

{"title":"从精神分裂症谱系和情绪障碍患者的药物性帕金森病病例报告中验证大型语言模型与人工信息提取：概念证明研究。","authors":"Sebastian Volkmer, Alina Glück, Andreas Meyer-Lindenberg, Emanuel Schwarz, Dusan Hirjak","doi":"10.1038/s41537-025-00601-5","DOIUrl":null,"url":null,"abstract":"In this proof of concept study, we demonstrated how Large Language Models (LLMs) can automate the conversion of unstructured case reports into clinical ratings. By leveraging instructions from a standardized clinical rating scale and evaluating the LLM's confidence in its outputs, we aimed to refine prompting strategies and enhance reproducibility. Using this strategy and case reports of drug-induced Parkinsonism, we showed that LLM-extracted data closely align with clinical rater manual extraction, achieving an accuracy of 90%.","PeriodicalId":74758,"journal":{"name":"Schizophrenia (Heidelberg, Germany)","volume":"11 1","pages":"47"},"PeriodicalIF":4.1000,"publicationDate":"2025-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11926372/pdf/","citationCount":"0","resultStr":"{\"title\":\"Validating large language models against manual information extraction from case reports of drug-induced parkinsonism in patients with schizophrenia spectrum and mood disorders: a proof of concept study.\",\"authors\":\"Sebastian Volkmer, Alina Glück, Andreas Meyer-Lindenberg, Emanuel Schwarz, Dusan Hirjak\",\"doi\":\"10.1038/s41537-025-00601-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this proof of concept study, we demonstrated how Large Language Models (LLMs) can automate the conversion of unstructured case reports into clinical ratings. By leveraging instructions from a standardized clinical rating scale and evaluating the LLM's confidence in its outputs, we aimed to refine prompting strategies and enhance reproducibility. Using this strategy and case reports of drug-induced Parkinsonism, we showed that LLM-extracted data closely align with clinical rater manual extraction, achieving an accuracy of 90%.\",\"PeriodicalId\":74758,\"journal\":{\"name\":\"Schizophrenia (Heidelberg, Germany)\",\"volume\":\"11 1\",\"pages\":\"47\"},\"PeriodicalIF\":4.1000,\"publicationDate\":\"2025-03-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11926372/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Schizophrenia (Heidelberg, Germany)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1038/s41537-025-00601-5\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"PSYCHIATRY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Schizophrenia (Heidelberg, Germany)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1038/s41537-025-00601-5","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PSYCHIATRY","Score":null,"Total":0}

引用次数: 0

摘要

在这个概念验证研究中，我们演示了大型语言模型（llm）如何将非结构化病例报告自动转换为临床评级。通过利用来自标准化临床评定量表的指示并评估LLM对其输出的信心，我们旨在完善提示策略并提高可重复性。利用这一策略和药物性帕金森病的病例报告，我们发现llm提取的数据与临床人工提取的数据密切一致，准确率达到90%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Validating large language models against manual information extraction from case reports of drug-induced parkinsonism in patients with schizophrenia spectrum and mood disorders: a proof of concept study.

查看原文本刊更多论文

In this proof of concept study, we demonstrated how Large Language Models (LLMs) can automate the conversion of unstructured case reports into clinical ratings. By leveraging instructions from a standardized clinical rating scale and evaluating the LLM's confidence in its outputs, we aimed to refine prompting strategies and enhance reproducibility. Using this strategy and case reports of drug-induced Parkinsonism, we showed that LLM-extracted data closely align with clinical rater manual extraction, achieving an accuracy of 90%.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Schizophrenia (Heidelberg, Germany)

自引率

0.00%

发文量