{"title":"从指南到代码:使用llm和RAG正式确定临床决策支持的STOPP/START标准。","authors":"Samya Adrouji, Abdelmalek Mouazer, Jean-Baptise Lamy","doi":"10.3233/SHTI251492","DOIUrl":null,"url":null,"abstract":"<p><p>STOPP/START v3 is a set of criteria for optimizing therapy for elderly patients with polypharmacy. Implementing these criteria in prescribing software requires to formalize them, which is a difficult task. This project aimed to automate the formalization of these criteria using large language models (LLMs), specifically leveraging Retrieval-Augmented Generation (RAG) for enhanced accuracy. We employed DeepSeek and GPT-4o-mini for entity extraction, code mapping to ICD-10, LOINC, and ATC, and the generation of executable Python code. A preliminary evaluation conducted on a subset of rules yielded a notably high F1-score (0.90, 0.92, 1 for drug, disease and observation entity mapping respectively and perfect results for medical entity extraction and code logic consistency). These results confirm the model's effectiveness in accurately transforming complex clinical rules into executable code. In conclusion, we successfully automated the creation of executable code from medical guidelines, proving that LLMs, supported by RAG, can be effective for automating clinical decision support tasks and formalizing medical rules.</p>","PeriodicalId":94357,"journal":{"name":"Studies in health technology and informatics","volume":"332 ","pages":"42-46"},"PeriodicalIF":0.0000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"From Guidelines to Code: Formalizing STOPP/START Criteria Using LLMs and RAG for Clinical Decision Support.\",\"authors\":\"Samya Adrouji, Abdelmalek Mouazer, Jean-Baptise Lamy\",\"doi\":\"10.3233/SHTI251492\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>STOPP/START v3 is a set of criteria for optimizing therapy for elderly patients with polypharmacy. Implementing these criteria in prescribing software requires to formalize them, which is a difficult task. This project aimed to automate the formalization of these criteria using large language models (LLMs), specifically leveraging Retrieval-Augmented Generation (RAG) for enhanced accuracy. We employed DeepSeek and GPT-4o-mini for entity extraction, code mapping to ICD-10, LOINC, and ATC, and the generation of executable Python code. A preliminary evaluation conducted on a subset of rules yielded a notably high F1-score (0.90, 0.92, 1 for drug, disease and observation entity mapping respectively and perfect results for medical entity extraction and code logic consistency). These results confirm the model's effectiveness in accurately transforming complex clinical rules into executable code. In conclusion, we successfully automated the creation of executable code from medical guidelines, proving that LLMs, supported by RAG, can be effective for automating clinical decision support tasks and formalizing medical rules.</p>\",\"PeriodicalId\":94357,\"journal\":{\"name\":\"Studies in health technology and informatics\",\"volume\":\"332 \",\"pages\":\"42-46\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-10-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Studies in health technology and informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3233/SHTI251492\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Studies in health technology and informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/SHTI251492","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
From Guidelines to Code: Formalizing STOPP/START Criteria Using LLMs and RAG for Clinical Decision Support.
STOPP/START v3 is a set of criteria for optimizing therapy for elderly patients with polypharmacy. Implementing these criteria in prescribing software requires to formalize them, which is a difficult task. This project aimed to automate the formalization of these criteria using large language models (LLMs), specifically leveraging Retrieval-Augmented Generation (RAG) for enhanced accuracy. We employed DeepSeek and GPT-4o-mini for entity extraction, code mapping to ICD-10, LOINC, and ATC, and the generation of executable Python code. A preliminary evaluation conducted on a subset of rules yielded a notably high F1-score (0.90, 0.92, 1 for drug, disease and observation entity mapping respectively and perfect results for medical entity extraction and code logic consistency). These results confirm the model's effectiveness in accurately transforming complex clinical rules into executable code. In conclusion, we successfully automated the creation of executable code from medical guidelines, proving that LLMs, supported by RAG, can be effective for automating clinical decision support tasks and formalizing medical rules.