Using artificial intelligence to develop a measure of orthopaedic treatment success from clinical notes.

IF 3.2 Q1 HEALTH CARE SCIENCES & SERVICES

Frontiers in digital health Pub Date : 2025-04-24 eCollection Date: 2025-01-01 DOI:10.3389/fdgth.2025.1523953

Sarah B Floyd, Ahmed G Almeldien, D Hudson Smith, Benjamin Judkins, Claire E Krohn, Zachary Cole Reynolds, Kyle Jeray, Jihad S Obeid

{"title":"Using artificial intelligence to develop a measure of orthopaedic treatment success from clinical notes.","authors":"Sarah B Floyd, Ahmed G Almeldien, D Hudson Smith, Benjamin Judkins, Claire E Krohn, Zachary Cole Reynolds, Kyle Jeray, Jihad S Obeid","doi":"10.3389/fdgth.2025.1523953","DOIUrl":null,"url":null,"abstract":"Introduction: A readily available outcome measure that reflects the success of a patient's treatment is needed to demonstrate the value of orthopaedic interventions. Patient-reported outcome measures (PROMs) are survey-based instruments that collect joint-specific and general health perceptions on symptoms, functioning, and health-related quality of life. PROMs are considered the gold standard outcome measure in orthopaedic medicine, but their use is limited in real-world practice due to challenges with technology integration, the pace of clinic workflows, and patient compliance. Clinical notes generated during each encounter patients have with their physician contain rich information on current disease symptoms, rehabilitation progress, and unexpected complications. Artificial intelligence (AI) methods can be used to identify phrases of treatment success or failure captured in clinical notes and discern an indicator of treatment success for orthopaedic patients.Methods: This was a cross-sectional analysis of clinical notes from a sample of patients with an acute shoulder injury. The study included adult patients presenting to a Level-1 Trauma Center and regional health system for an acute Proximal Humerus Fracture (PHF) between January 1, 2019 and December 31, 2021. We used the progress note from the office visit for PHF-related care (ICD10: S42.2XXX) or shoulder pain (ICD10: M45.2XXX) closest to 1-year after the injury date. Clinical notes were reviewed by an orthopaedic resident and labeled as treatment success or failure. A structured comparative analysis of classifiers including both machine and deep learning algorithms was performed.Results: The final sample included 868 clinical notes from patients treated by 123 physicians across 35 departments within one regional health system. The study sample was stratified into 465 notes labeled as treatment success and 403 labeled as treatment failure. The Bio-ClinicalBERT model had the highest performance of 87% accuracy (AUC = 0.87 ± 0.04) in correctly distinguishing between treatment success and failure notes.Discussion: Our results suggest that text classifiers applied to clinical notes are capable of differentiating patients with successful treatment outcomes with high levels of accuracy. This finding is encouraging, signaling that routinely collected clinical note content may serve as a data source to develop an outcome measure for orthopaedic patients.","PeriodicalId":73078,"journal":{"name":"Frontiers in digital health","volume":"7 ","pages":"1523953"},"PeriodicalIF":3.2000,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12058693/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in digital health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fdgth.2025.1523953","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

Abstract

Introduction: A readily available outcome measure that reflects the success of a patient's treatment is needed to demonstrate the value of orthopaedic interventions. Patient-reported outcome measures (PROMs) are survey-based instruments that collect joint-specific and general health perceptions on symptoms, functioning, and health-related quality of life. PROMs are considered the gold standard outcome measure in orthopaedic medicine, but their use is limited in real-world practice due to challenges with technology integration, the pace of clinic workflows, and patient compliance. Clinical notes generated during each encounter patients have with their physician contain rich information on current disease symptoms, rehabilitation progress, and unexpected complications. Artificial intelligence (AI) methods can be used to identify phrases of treatment success or failure captured in clinical notes and discern an indicator of treatment success for orthopaedic patients.

Methods: This was a cross-sectional analysis of clinical notes from a sample of patients with an acute shoulder injury. The study included adult patients presenting to a Level-1 Trauma Center and regional health system for an acute Proximal Humerus Fracture (PHF) between January 1, 2019 and December 31, 2021. We used the progress note from the office visit for PHF-related care (ICD10: S42.2XXX) or shoulder pain (ICD10: M45.2XXX) closest to 1-year after the injury date. Clinical notes were reviewed by an orthopaedic resident and labeled as treatment success or failure. A structured comparative analysis of classifiers including both machine and deep learning algorithms was performed.

Results: The final sample included 868 clinical notes from patients treated by 123 physicians across 35 departments within one regional health system. The study sample was stratified into 465 notes labeled as treatment success and 403 labeled as treatment failure. The Bio-ClinicalBERT model had the highest performance of 87% accuracy (AUC = 0.87 ± 0.04) in correctly distinguishing between treatment success and failure notes.

Discussion: Our results suggest that text classifiers applied to clinical notes are capable of differentiating patients with successful treatment outcomes with high levels of accuracy. This finding is encouraging, signaling that routinely collected clinical note content may serve as a data source to develop an outcome measure for orthopaedic patients.

查看原文本刊更多论文

利用人工智能从临床记录中开发一种衡量骨科治疗成功的方法。

简介：需要一个现成的反映患者治疗成功的结果测量来证明矫形干预的价值。患者报告的结果测量（PROMs）是基于调查的工具，收集关节特异性和一般健康感知的症状、功能和健康相关的生活质量。PROMs被认为是骨科医学中衡量结果的黄金标准，但由于技术集成、临床工作流程的速度和患者依从性方面的挑战，其在实际实践中的应用受到限制。临床记录产生在每次遇到病人与他们的医生包含丰富的信息，目前的疾病症状，康复的进展，和意想不到的并发症。人工智能（AI）方法可用于识别临床记录中捕获的治疗成功或失败的短语，并识别骨科患者治疗成功的指标。方法：这是对急性肩伤患者样本的临床记录进行横断面分析。该研究包括2019年1月1日至2021年12月31日期间在一级创伤中心和区域卫生系统就诊的急性肱骨近端骨折（PHF）成年患者。我们使用了受伤日期后近1年的phf相关护理（ICD10: S42.2XXX）或肩痛（ICD10: M45.2XXX）的办公室就诊进度记录。临床记录由骨科住院医师审查，并标记为治疗成功或失败。对包括机器和深度学习算法的分类器进行了结构化的比较分析。结果：最终样本包括来自一个区域卫生系统内35个部门123名医生治疗的患者的868份临床记录。研究样本被分为465个标记为治疗成功的笔记和403个标记为治疗失败的笔记。在正确区分治疗成功和失败记录方面，Bio-ClinicalBERT模型的准确率最高，达到87% （AUC = 0.87±0.04）。讨论：我们的研究结果表明，应用于临床记录的文本分类器能够以高水平的准确性区分成功治疗结果的患者。这一发现是令人鼓舞的，表明常规收集的临床记录内容可以作为数据来源来开发骨科患者的结果测量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊