Ömer Alperen Gürses, Anıl Özüdoğru, Figen Tuncay, Caner Kararti
{"title":"人工智能大语言模型在膝骨关节炎个性化康复方案中的作用:一项观察性研究。","authors":"Ömer Alperen Gürses, Anıl Özüdoğru, Figen Tuncay, Caner Kararti","doi":"10.1007/s10916-025-02207-x","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Large language models (LLMs) can contribute to treatment options and outcomes by assisting physiotherapists for conditions like osteoarthritis.</p><p><strong>Aims: </strong>The objective of this early-stage cross-sectional study is to assess the alignment of large language models with physiotherapists in designing physiotherapy and rehabilitation programs for knee osteoarthritis.</p><p><strong>Methods: </strong>Forty patients diagnosed with knee osteoarthritis were assessed using standardized clinical criteria. For each patient, individualized rehabilitation programs were created by three physiotherapists and by ChatGPT-4o and Gemini Advanced using structured prompts. The presence or absence of 50 clinically relevant rehabilitation parameters was recorded for each program. Chi-square tests were used to evaluate agreement rates between the LLMs and the physiotherapist-generated Consensus programs.</p><p><strong>Results: </strong>ChatGPT-4o achieved a 74% agreement rate with the physiotherapists' Consensus programs, while Gemini Advanced achieved 70%. Although both models showed high compatibility with general rehabilitation components, they demonstrated notable limitations in exercise specificity, including frequency, sets, and progression criteria. ChatGPT-4o performed as well as or better than Gemini in most phases, particularly in Phase 3, while Gemini showed lower consistency in balance and stabilization parameters.</p><p><strong>Conclusions: </strong>ChatGPT-4o and Gemini Advanced demonstrate promising potential in generating personalized rehabilitation programs for knee osteoarthritis. While their outputs generally align with expert recommendations, notable gaps remain in clinical reasoning and the provision of detailed exercise parameters. These findings underscore the importance of ongoing model refinement and the necessity of expert supervision for safe and effective clinical integration.</p>","PeriodicalId":16338,"journal":{"name":"Journal of Medical Systems","volume":"49 1","pages":"73"},"PeriodicalIF":5.7000,"publicationDate":"2025-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12134017/pdf/","citationCount":"0","resultStr":"{\"title\":\"The Role of Artificial Intelligence Large Language Models in Personalized Rehabilitation Programs for Knee Osteoarthritis: An Observational Study.\",\"authors\":\"Ömer Alperen Gürses, Anıl Özüdoğru, Figen Tuncay, Caner Kararti\",\"doi\":\"10.1007/s10916-025-02207-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Large language models (LLMs) can contribute to treatment options and outcomes by assisting physiotherapists for conditions like osteoarthritis.</p><p><strong>Aims: </strong>The objective of this early-stage cross-sectional study is to assess the alignment of large language models with physiotherapists in designing physiotherapy and rehabilitation programs for knee osteoarthritis.</p><p><strong>Methods: </strong>Forty patients diagnosed with knee osteoarthritis were assessed using standardized clinical criteria. For each patient, individualized rehabilitation programs were created by three physiotherapists and by ChatGPT-4o and Gemini Advanced using structured prompts. The presence or absence of 50 clinically relevant rehabilitation parameters was recorded for each program. Chi-square tests were used to evaluate agreement rates between the LLMs and the physiotherapist-generated Consensus programs.</p><p><strong>Results: </strong>ChatGPT-4o achieved a 74% agreement rate with the physiotherapists' Consensus programs, while Gemini Advanced achieved 70%. Although both models showed high compatibility with general rehabilitation components, they demonstrated notable limitations in exercise specificity, including frequency, sets, and progression criteria. ChatGPT-4o performed as well as or better than Gemini in most phases, particularly in Phase 3, while Gemini showed lower consistency in balance and stabilization parameters.</p><p><strong>Conclusions: </strong>ChatGPT-4o and Gemini Advanced demonstrate promising potential in generating personalized rehabilitation programs for knee osteoarthritis. While their outputs generally align with expert recommendations, notable gaps remain in clinical reasoning and the provision of detailed exercise parameters. These findings underscore the importance of ongoing model refinement and the necessity of expert supervision for safe and effective clinical integration.</p>\",\"PeriodicalId\":16338,\"journal\":{\"name\":\"Journal of Medical Systems\",\"volume\":\"49 1\",\"pages\":\"73\"},\"PeriodicalIF\":5.7000,\"publicationDate\":\"2025-06-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12134017/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Medical Systems\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s10916-025-02207-x\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Medical Systems","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s10916-025-02207-x","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
The Role of Artificial Intelligence Large Language Models in Personalized Rehabilitation Programs for Knee Osteoarthritis: An Observational Study.
Background: Large language models (LLMs) can contribute to treatment options and outcomes by assisting physiotherapists for conditions like osteoarthritis.
Aims: The objective of this early-stage cross-sectional study is to assess the alignment of large language models with physiotherapists in designing physiotherapy and rehabilitation programs for knee osteoarthritis.
Methods: Forty patients diagnosed with knee osteoarthritis were assessed using standardized clinical criteria. For each patient, individualized rehabilitation programs were created by three physiotherapists and by ChatGPT-4o and Gemini Advanced using structured prompts. The presence or absence of 50 clinically relevant rehabilitation parameters was recorded for each program. Chi-square tests were used to evaluate agreement rates between the LLMs and the physiotherapist-generated Consensus programs.
Results: ChatGPT-4o achieved a 74% agreement rate with the physiotherapists' Consensus programs, while Gemini Advanced achieved 70%. Although both models showed high compatibility with general rehabilitation components, they demonstrated notable limitations in exercise specificity, including frequency, sets, and progression criteria. ChatGPT-4o performed as well as or better than Gemini in most phases, particularly in Phase 3, while Gemini showed lower consistency in balance and stabilization parameters.
Conclusions: ChatGPT-4o and Gemini Advanced demonstrate promising potential in generating personalized rehabilitation programs for knee osteoarthritis. While their outputs generally align with expert recommendations, notable gaps remain in clinical reasoning and the provision of detailed exercise parameters. These findings underscore the importance of ongoing model refinement and the necessity of expert supervision for safe and effective clinical integration.
期刊介绍:
Journal of Medical Systems provides a forum for the presentation and discussion of the increasingly extensive applications of new systems techniques and methods in hospital clinic and physician''s office administration; pathology radiology and pharmaceutical delivery systems; medical records storage and retrieval; and ancillary patient-support systems. The journal publishes informative articles essays and studies across the entire scale of medical systems from large hospital programs to novel small-scale medical services. Education is an integral part of this amalgamation of sciences and selected articles are published in this area. Since existing medical systems are constantly being modified to fit particular circumstances and to solve specific problems the journal includes a special section devoted to status reports on current installations.