Methodological evaluation of systematic reviews based on the use of artificial intelligence systems in chest radiography

Radiologia Pub Date : 2024-07-01 DOI:10.1016/j.rxeng.2023.01.015

J. Vidal-Mondéjar , L. Tejedor-Romero , F. Catalá-López

{"title":"Methodological evaluation of systematic reviews based on the use of artificial intelligence systems in chest radiography","authors":"J. Vidal-Mondéjar , L. Tejedor-Romero , F. Catalá-López","doi":"10.1016/j.rxeng.2023.01.015","DOIUrl":null,"url":null,"abstract":"<div><h3>Introduction</h3><p>In recent years, systems that use artificial intelligence (AI) in medical imaging have been developed, such as the interpretation of chest X-ray to rule out pathology. This has produced an increase in systematic reviews (SR) published on this topic. This article aims to evaluate the methodological quality of SRs that use AI for the diagnosis of thoracic pathology by simple chest X-ray.</p></div><div><h3>Material and methods</h3><p>SRs evaluating the use of AI systems for the automatic reading of chest X-ray were selected. Searches were conducted (from inception to May 2022): PubMed, EMBASE, and the Cochrane Database of Systematic Reviews. Two investigators selected the reviews. From each SR, general, methodological and transparency characteristics were extracted. The PRISMA statement for diagnostic tests (PRISMA-DTA) and AMSTAR-2 were used. A narrative synthesis of the evidence was performed. Protocol registry: Open Science Framework: <span><span>https://osf.io/4b6u2/</span><svg><path></path></svg></span>.</p></div><div><h3>Results</h3><p>After applying the inclusion and exclusion criteria, 7 SRs were selected (mean of 36 included studies per review). All the included SRs evaluated “deep learning” systems in which chest X-ray was used for the diagnosis of infectious diseases. Only 2 (29%) SRs indicated the existence of a review protocol. None of the SRs specified the design of the included studies or provided a list of excluded studies with their justification. Six (86%) SRs mentioned the use of PRISMA or one of its extensions. The risk of bias assessment was performed in 4 (57%) SRs. One (14%) SR included studies with some validation of AI techniques. Five (71%) SRs presented results in favour of the diagnostic capacity of the intervention. All SRs were rated \"critically low\" following AMSTAR-2 criteria.</p></div><div><h3>Conclusions</h3><p>The methodological quality of SRs that use AI systems in chest radiography can be improved. The lack of compliance in some items of the tools used means that the SRs published in this field must be interpreted with caution.</p></div>","PeriodicalId":94185,"journal":{"name":"Radiologia","volume":"66 4","pages":"Pages 326-339"},"PeriodicalIF":0.0000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Radiologia","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2173510724000934","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Introduction

In recent years, systems that use artificial intelligence (AI) in medical imaging have been developed, such as the interpretation of chest X-ray to rule out pathology. This has produced an increase in systematic reviews (SR) published on this topic. This article aims to evaluate the methodological quality of SRs that use AI for the diagnosis of thoracic pathology by simple chest X-ray.

Material and methods

SRs evaluating the use of AI systems for the automatic reading of chest X-ray were selected. Searches were conducted (from inception to May 2022): PubMed, EMBASE, and the Cochrane Database of Systematic Reviews. Two investigators selected the reviews. From each SR, general, methodological and transparency characteristics were extracted. The PRISMA statement for diagnostic tests (PRISMA-DTA) and AMSTAR-2 were used. A narrative synthesis of the evidence was performed. Protocol registry: Open Science Framework: https://osf.io/4b6u2/.

Results

After applying the inclusion and exclusion criteria, 7 SRs were selected (mean of 36 included studies per review). All the included SRs evaluated “deep learning” systems in which chest X-ray was used for the diagnosis of infectious diseases. Only 2 (29%) SRs indicated the existence of a review protocol. None of the SRs specified the design of the included studies or provided a list of excluded studies with their justification. Six (86%) SRs mentioned the use of PRISMA or one of its extensions. The risk of bias assessment was performed in 4 (57%) SRs. One (14%) SR included studies with some validation of AI techniques. Five (71%) SRs presented results in favour of the diagnostic capacity of the intervention. All SRs were rated "critically low" following AMSTAR-2 criteria.

Conclusions

The methodological quality of SRs that use AI systems in chest radiography can be improved. The lack of compliance in some items of the tools used means that the SRs published in this field must be interpreted with caution.

查看原文本刊更多论文

基于人工智能系统在胸部放射摄影中的应用对系统综述进行方法学评估。

导言近年来，在医学影像中使用人工智能（AI）的系统不断发展，例如解读胸部 X 光片以排除病变。因此，有关这一主题的系统综述（SR）也越来越多。本文旨在评估使用人工智能通过简单胸部X光诊断胸部病变的系统综述的方法学质量：材料和方法：选取了评估使用人工智能系统自动读取胸部 X 光片的研究报告。进行了检索（从开始到 2022 年 5 月）：PubMed、EMBASE 和 Cochrane 系统综述数据库。两名研究者对综述进行了筛选。从每篇综述中提取一般特征、方法特征和透明度特征。采用了诊断测试的 PRISMA 声明（PRISMA-DTA）和 AMSTAR-2。对证据进行了叙述性综合。协议注册：开放科学框架：https://osf.io/4b6u2/.Results：在应用纳入和排除标准后，共筛选出 7 篇 SR（每篇综述平均纳入 36 项研究）。所有纳入的研究报告都对 "深度学习 "系统进行了评估，其中胸部 X 光片被用于诊断传染性疾病。只有 2 篇（29%）SR 表明有综述协议。没有一份员工代表说明了所纳入研究的设计，也没有提供排除研究的清单及其理由。有 6 份（86%）员工代表提到使用了 PRISMA 或其扩展版之一。有 4 份（57%）SR 进行了偏倚风险评估。一份（14%）标准报告纳入了人工智能技术的一些验证研究。有 5 份（71%）标准研究报告的结果支持干预措施的诊断能力。根据 AMSTAR-2 标准，所有 SR 均被评为 "极低"：结论：在胸部放射摄影中使用人工智能系统的 SR 的方法学质量有待提高。所使用工具的某些项目缺乏合规性，这意味着必须谨慎解读该领域发表的研究报告。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Radiologia

自引率

0.00%

发文量