A systematic review of natural language processing applications in Trauma & Orthopaedics.

IF 2.8 Q1 ORTHOPEDICS

Bone & Joint Open Pub Date : 2025-03-05 DOI:10.1302/2633-1462.63.BJO-2024-0081.R1

Luke Farrow, Arslan Raja, Mingjun Zhong, Lesley Anderson

{"title":"A systematic review of natural language processing applications in Trauma & Orthopaedics.","authors":"Luke Farrow, Arslan Raja, Mingjun Zhong, Lesley Anderson","doi":"10.1302/2633-1462.63.BJO-2024-0081.R1","DOIUrl":null,"url":null,"abstract":"Aims: Prevalence of artificial intelligence (AI) algorithms within the Trauma & Orthopaedics (T&O) literature has greatly increased over the last ten years. One increasingly explored aspect of AI is the automated interpretation of free-text data often prevalent in electronic medical records (known as natural language processing (NLP)). We set out to review the current evidence for applications of NLP methodology in T&O, including assessment of study design and reporting.Methods: MEDLINE, Allied and Complementary Medicine (AMED), Excerpta Medica Database (EMBASE), and Cochrane Central Register of Controlled Trials (CENTRAL) were screened for studies pertaining to NLP in T&O from database inception to 31 December 2023. An additional grey literature search was performed. NLP quality assessment followed the criteria outlined by Farrow et al in 2021 with two independent reviewers (classification as absent, incomplete, or complete). Reporting was performed according to the Synthesis-Without Meta-Analysis (SWiM) guidelines. The review protocol was registered on the Prospective Register of Systematic Reviews (PROSPERO; registration no. CRD42022291714).Results: The final review included 31 articles (published between 2012 and 2021). The most common subspeciality areas included trauma, arthroplasty, and spine; 13% (4/31) related to online reviews/social media, 42% (13/31) to clinical notes/operation notes, 42% (13/31) to radiology reports, and 3% (1/31) to systematic review. According to the reporting criteria, 16% (5/31) were considered good quality, 74% (23/31) average quality, and 6% (2/31) poor quality. The most commonly absent reporting criteria were evaluation of missing data (26/31), sample size calculation (31/31), and external validation of the study results (29/31 papers). Code and data availability were also poorly documented in most studies.Conclusion: Application of NLP is becoming increasingly common in T&O; however, published article quality is mixed, with few high-quality studies. There are key consistent deficiencies in published work relating to NLP which ultimately influence the potential for clinical application. Open science is an important part of research transparency that should be encouraged in NLP algorithm development and reporting.","PeriodicalId":34103,"journal":{"name":"Bone & Joint Open","volume":"6 3","pages":"264-274"},"PeriodicalIF":2.8000,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11879473/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bone & Joint Open","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1302/2633-1462.63.BJO-2024-0081.R1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ORTHOPEDICS","Score":null,"Total":0}

引用次数: 0

Abstract

Aims: Prevalence of artificial intelligence (AI) algorithms within the Trauma & Orthopaedics (T&O) literature has greatly increased over the last ten years. One increasingly explored aspect of AI is the automated interpretation of free-text data often prevalent in electronic medical records (known as natural language processing (NLP)). We set out to review the current evidence for applications of NLP methodology in T&O, including assessment of study design and reporting.

Methods: MEDLINE, Allied and Complementary Medicine (AMED), Excerpta Medica Database (EMBASE), and Cochrane Central Register of Controlled Trials (CENTRAL) were screened for studies pertaining to NLP in T&O from database inception to 31 December 2023. An additional grey literature search was performed. NLP quality assessment followed the criteria outlined by Farrow et al in 2021 with two independent reviewers (classification as absent, incomplete, or complete). Reporting was performed according to the Synthesis-Without Meta-Analysis (SWiM) guidelines. The review protocol was registered on the Prospective Register of Systematic Reviews (PROSPERO; registration no. CRD42022291714).

Results: The final review included 31 articles (published between 2012 and 2021). The most common subspeciality areas included trauma, arthroplasty, and spine; 13% (4/31) related to online reviews/social media, 42% (13/31) to clinical notes/operation notes, 42% (13/31) to radiology reports, and 3% (1/31) to systematic review. According to the reporting criteria, 16% (5/31) were considered good quality, 74% (23/31) average quality, and 6% (2/31) poor quality. The most commonly absent reporting criteria were evaluation of missing data (26/31), sample size calculation (31/31), and external validation of the study results (29/31 papers). Code and data availability were also poorly documented in most studies.

Conclusion: Application of NLP is becoming increasingly common in T&O; however, published article quality is mixed, with few high-quality studies. There are key consistent deficiencies in published work relating to NLP which ultimately influence the potential for clinical application. Open science is an important part of research transparency that should be encouraged in NLP algorithm development and reporting.

查看原文本刊更多论文

自然语言处理在创伤与骨科中的应用综述。

目的：人工智能（AI）算法在创伤与骨科（T&O）文献中的流行在过去十年中大大增加。人工智能的一个日益被探索的方面是对电子病历中普遍存在的自由文本数据的自动解释（称为自然语言处理（NLP））。我们着手审查NLP方法在T&O中应用的现有证据，包括研究设计和报告的评估。方法：筛选MEDLINE、联合和补充医学（AMED）、医学摘录数据库（EMBASE）和Cochrane中央对照试验登记册（Central）从数据库建立到2023年12月31日与T&O中NLP相关的研究。进行了额外的灰色文献检索。NLP质量评估遵循Farrow等人在2021年由两名独立审稿人概述的标准（分类为缺席、不完整或完整）。根据综合-无荟萃分析（SWiM）指南进行报告。该审查方案已在前瞻性系统审查登记册(PROSPERO；没有注册。CRD42022291714)。最终纳入31篇文章（发表时间为2012 - 2021年）。最常见的亚专科包括创伤、关节成形术和脊柱；13%（4/31）与在线评论/社交媒体有关，42%（13/31）与临床记录/手术记录有关，42%（13/31）与放射学报告有关，3%（1/31）与系统评价有关。根据报告标准，16%（5/31）被认为质量好，74%（23/31）被认为质量一般，6%（2/31）被认为质量差。最常见的缺失报告标准是缺失数据的评估（26/31）、样本量计算（31/31）和研究结果的外部验证（29/31）。在大多数研究中，代码和数据的可用性也没有得到很好的记录。结论：NLP在T&O中的应用越来越普遍；然而，发表的文章质量参差不齐，高质量的研究很少。在与NLP相关的已发表的工作中存在关键的一致缺陷，这些缺陷最终影响了临床应用的潜力。开放科学是研究透明度的重要组成部分，在NLP算法开发和报告中应该鼓励开放科学。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊