{"title":"Readability of Orthopaedic Patient Educational Material: An artificial intelligence application","authors":"Miles LaNicca , Ellis Wright , Ellen Lutnick","doi":"10.1016/j.jcot.2025.102971","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>This study aims to determine the efficacy of the use of artificial intelligence (AI) in rewriting orthopaedic trauma hospital patient educational materials to a patient-appropriate reading level.</div></div><div><h3>Materials and methods</h3><div>35 orthopaedic patient educational articles were identified from three hospital networks with Level 1 Trauma Centers, categorized based on average reading level. They were run through a formatting Python code, and then a secondary code to determine readability metrics outlined in Table 1. The articles were then rewritten via four iterations of Generative Pre-Trained Transformer (GPT) AI language models. Each model was given the same prompt, outlined in Fig. 1, to rewrite the articles to a 6th grade reading level per AMA recommendations. The rewritten articles were checked for accuracy and formatted and scored to determine mean reading level. Additional analysis was run comparing 9 different AI models from 3 different companies, using the same prompt, comparing cost and percent token reduction.</div></div><div><h3>Results</h3><div>All GPT AI models lowered the mean combined grade level (Table 2). Fig. 2 compares each GPT model's output to the original articles reading grade level. The oldest model (GPT-3.5-Turbo) was the least consistent and least effective. GPT-4o-Mini and GPT-4o were the most effective and consistent regardless of original article difficulty. Table 3 outlines the cost of running all 35 articles through each GPT model. The most accurate model (GPT-4o) was only $0.61; however, there was only a 0.421 % increase in effectiveness comparing GPT-4o vs. GPT-4o-Mini, at a 175.38 % increase in cost. All GPT rewritten articles were screened for accuracy and determined to have no falsified information or medical inaccuracies. Expanded analysis across 9 AI models is demonstrated in Fig. 4. Fig. 5 compares cost and percent token reduction.</div></div><div><h3>Conclusion</h3><div>AI is a viable option for reducing the reading difficulty of patient educational materials while maintaining accuracy. Of the models included for analysis, GPT-4o-Mini appears to be the most efficient language model when considering effectiveness, cost, and maintenance of the information included in the original articles.</div></div>","PeriodicalId":53594,"journal":{"name":"Journal of Clinical Orthopaedics and Trauma","volume":"64 ","pages":"Article 102971"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Clinical Orthopaedics and Trauma","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0976566225000670","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0
Abstract
Background
This study aims to determine the efficacy of the use of artificial intelligence (AI) in rewriting orthopaedic trauma hospital patient educational materials to a patient-appropriate reading level.
Materials and methods
35 orthopaedic patient educational articles were identified from three hospital networks with Level 1 Trauma Centers, categorized based on average reading level. They were run through a formatting Python code, and then a secondary code to determine readability metrics outlined in Table 1. The articles were then rewritten via four iterations of Generative Pre-Trained Transformer (GPT) AI language models. Each model was given the same prompt, outlined in Fig. 1, to rewrite the articles to a 6th grade reading level per AMA recommendations. The rewritten articles were checked for accuracy and formatted and scored to determine mean reading level. Additional analysis was run comparing 9 different AI models from 3 different companies, using the same prompt, comparing cost and percent token reduction.
Results
All GPT AI models lowered the mean combined grade level (Table 2). Fig. 2 compares each GPT model's output to the original articles reading grade level. The oldest model (GPT-3.5-Turbo) was the least consistent and least effective. GPT-4o-Mini and GPT-4o were the most effective and consistent regardless of original article difficulty. Table 3 outlines the cost of running all 35 articles through each GPT model. The most accurate model (GPT-4o) was only $0.61; however, there was only a 0.421 % increase in effectiveness comparing GPT-4o vs. GPT-4o-Mini, at a 175.38 % increase in cost. All GPT rewritten articles were screened for accuracy and determined to have no falsified information or medical inaccuracies. Expanded analysis across 9 AI models is demonstrated in Fig. 4. Fig. 5 compares cost and percent token reduction.
Conclusion
AI is a viable option for reducing the reading difficulty of patient educational materials while maintaining accuracy. Of the models included for analysis, GPT-4o-Mini appears to be the most efficient language model when considering effectiveness, cost, and maintenance of the information included in the original articles.
期刊介绍:
Journal of Clinical Orthopaedics and Trauma (JCOT) aims to provide its readers with the latest clinical and basic research, and informed opinions that shape today''s orthopedic practice, thereby providing an opportunity to practice evidence-based medicine. With contributions from leading clinicians and researchers around the world, we aim to be the premier journal providing an international perspective advancing knowledge of the musculoskeletal system. JCOT publishes content of value to both general orthopedic practitioners and specialists on all aspects of musculoskeletal research, diagnoses, and treatment. We accept following types of articles: • Original articles focusing on current clinical issues. • Review articles with learning value for professionals as well as students. • Research articles providing the latest in basic biological or engineering research on musculoskeletal diseases. • Regular columns by experts discussing issues affecting the field of orthopedics. • "Symposia" devoted to a single topic offering the general reader an overview of a field, but providing the specialist current in-depth information. • Video of any orthopedic surgery which is innovative and adds to present concepts. • Articles emphasizing or demonstrating a new clinical sign in the art of patient examination is also considered for publication. Contributions from anywhere in the world are welcome and considered on their merits.