从指南到代码：使用llm和RAG正式确定临床决策支持的STOPP/START标准。

Studies in health technology and informatics Pub Date : 2025-10-02 DOI:10.3233/SHTI251492

Samya Adrouji, Abdelmalek Mouazer, Jean-Baptise Lamy

{"title":"从指南到代码：使用llm和RAG正式确定临床决策支持的STOPP/START标准。","authors":"Samya Adrouji, Abdelmalek Mouazer, Jean-Baptise Lamy","doi":"10.3233/SHTI251492","DOIUrl":null,"url":null,"abstract":"STOPP/START v3 is a set of criteria for optimizing therapy for elderly patients with polypharmacy. Implementing these criteria in prescribing software requires to formalize them, which is a difficult task. This project aimed to automate the formalization of these criteria using large language models (LLMs), specifically leveraging Retrieval-Augmented Generation (RAG) for enhanced accuracy. We employed DeepSeek and GPT-4o-mini for entity extraction, code mapping to ICD-10, LOINC, and ATC, and the generation of executable Python code. A preliminary evaluation conducted on a subset of rules yielded a notably high F1-score (0.90, 0.92, 1 for drug, disease and observation entity mapping respectively and perfect results for medical entity extraction and code logic consistency). These results confirm the model's effectiveness in accurately transforming complex clinical rules into executable code. In conclusion, we successfully automated the creation of executable code from medical guidelines, proving that LLMs, supported by RAG, can be effective for automating clinical decision support tasks and formalizing medical rules.","PeriodicalId":94357,"journal":{"name":"Studies in health technology and informatics","volume":"332 ","pages":"42-46"},"PeriodicalIF":0.0000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"From Guidelines to Code: Formalizing STOPP/START Criteria Using LLMs and RAG for Clinical Decision Support.\",\"authors\":\"Samya Adrouji, Abdelmalek Mouazer, Jean-Baptise Lamy\",\"doi\":\"10.3233/SHTI251492\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"STOPP/START v3 is a set of criteria for optimizing therapy for elderly patients with polypharmacy. Implementing these criteria in prescribing software requires to formalize them, which is a difficult task. This project aimed to automate the formalization of these criteria using large language models (LLMs), specifically leveraging Retrieval-Augmented Generation (RAG) for enhanced accuracy. We employed DeepSeek and GPT-4o-mini for entity extraction, code mapping to ICD-10, LOINC, and ATC, and the generation of executable Python code. A preliminary evaluation conducted on a subset of rules yielded a notably high F1-score (0.90, 0.92, 1 for drug, disease and observation entity mapping respectively and perfect results for medical entity extraction and code logic consistency). These results confirm the model's effectiveness in accurately transforming complex clinical rules into executable code. In conclusion, we successfully automated the creation of executable code from medical guidelines, proving that LLMs, supported by RAG, can be effective for automating clinical decision support tasks and formalizing medical rules.\",\"PeriodicalId\":94357,\"journal\":{\"name\":\"Studies in health technology and informatics\",\"volume\":\"332 \",\"pages\":\"42-46\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-10-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Studies in health technology and informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3233/SHTI251492\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Studies in health technology and informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/SHTI251492","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

STOPP/START v3是一套优化老年多药患者治疗的标准。在规定软件时实现这些标准需要将它们形式化，这是一项困难的任务。该项目旨在使用大型语言模型（llm）自动形式化这些标准，特别是利用检索增强生成（RAG）来提高准确性。我们使用DeepSeek和gpt - 40 -mini进行实体提取，代码映射到ICD-10， LOINC和ATC，并生成可执行的Python代码。对规则子集进行初步评价，获得了非常高的f1分（药物、疾病和观察实体映射分别为0.90、0.92和1，医疗实体提取和代码逻辑一致性取得了很好的结果）。这些结果证实了该模型在将复杂的临床规则准确转换为可执行代码方面的有效性。总之，我们成功地自动化了医疗指南中可执行代码的创建，证明了由RAG支持的llm可以有效地自动化临床决策支持任务和形式化医疗规则。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

From Guidelines to Code: Formalizing STOPP/START Criteria Using LLMs and RAG for Clinical Decision Support.

STOPP/START v3 is a set of criteria for optimizing therapy for elderly patients with polypharmacy. Implementing these criteria in prescribing software requires to formalize them, which is a difficult task. This project aimed to automate the formalization of these criteria using large language models (LLMs), specifically leveraging Retrieval-Augmented Generation (RAG) for enhanced accuracy. We employed DeepSeek and GPT-4o-mini for entity extraction, code mapping to ICD-10, LOINC, and ATC, and the generation of executable Python code. A preliminary evaluation conducted on a subset of rules yielded a notably high F1-score (0.90, 0.92, 1 for drug, disease and observation entity mapping respectively and perfect results for medical entity extraction and code logic consistency). These results confirm the model's effectiveness in accurately transforming complex clinical rules into executable code. In conclusion, we successfully automated the creation of executable code from medical guidelines, proving that LLMs, supported by RAG, can be effective for automating clinical decision support tasks and formalizing medical rules.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Studies in health technology and informatics

自引率

0.00%

发文量