A systematic review of active learning approaches in the selection of medical images

Procedia Computer Science Pub Date : 2025-01-01 DOI:10.1016/j.procs.2025.02.186

Maria Santos , Goreti Marreiros

{"title":"A systematic review of active learning approaches in the selection of medical images","authors":"Maria Santos , Goreti Marreiros","doi":"10.1016/j.procs.2025.02.186","DOIUrl":null,"url":null,"abstract":"<div><div>Background: Active Learning has been proven to be an effective way to maximize the model’s learning capacity, using fewer amounts of labeled data. In the field of medical imaging data, data and annotations can be scarce and very expensive to obtain, so techniques like Active Learning can be a useful solution. Methods: For this systematic review, the data sources were obtained through IEEE Explore, PubMed, and ACM Digital Library, between the period of 2018 and 2023. Only studies that belonged to the field of healthcare (using medical images as a dataset) and machine learning, written in English and that were not a book, or a survey were used. Covidence was used as a tool to synthesize the results. Results: From 336 records, 51 were included in this review. Interpretation: Most studies showed that Active Learning can have a positive impact on the construction of models, however, it is important to not consider only the informativeness/uncertainty of the sample, but also the distribution of the data, reducing the probability of selecting samples that are not representative enough of the dataset or outliers. Active Learning is usually an iterative process until a stop criterion is met, for example, the model’s performance. To evaluate an Active Learning solution, the proposed method is usually compared with random sampling, or other Active Learning queries.</div></div>","PeriodicalId":20465,"journal":{"name":"Procedia Computer Science","volume":"256 ","pages":"Pages 843-851"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Procedia Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1877050925005472","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Active Learning has been proven to be an effective way to maximize the model’s learning capacity, using fewer amounts of labeled data. In the field of medical imaging data, data and annotations can be scarce and very expensive to obtain, so techniques like Active Learning can be a useful solution. Methods: For this systematic review, the data sources were obtained through IEEE Explore, PubMed, and ACM Digital Library, between the period of 2018 and 2023. Only studies that belonged to the field of healthcare (using medical images as a dataset) and machine learning, written in English and that were not a book, or a survey were used. Covidence was used as a tool to synthesize the results. Results: From 336 records, 51 were included in this review. Interpretation: Most studies showed that Active Learning can have a positive impact on the construction of models, however, it is important to not consider only the informativeness/uncertainty of the sample, but also the distribution of the data, reducing the probability of selecting samples that are not representative enough of the dataset or outliers. Active Learning is usually an iterative process until a stop criterion is met, for example, the model’s performance. To evaluate an Active Learning solution, the proposed method is usually compared with random sampling, or other Active Learning queries.

查看原文本刊更多论文

主动学习方法在医学图像选择中的系统回顾

背景：主动学习已经被证明是一种有效的方法来最大化模型的学习能力，使用较少的标记数据。在医学成像数据领域，数据和注释可能是稀缺的，并且获取起来非常昂贵，因此像主动学习这样的技术可能是一个有用的解决方案。方法：本系统综述的数据来源为2018年至2023年，通过IEEE Explore、PubMed和ACM数字图书馆获得。只使用了属于医疗保健领域（使用医学图像作为数据集）和机器学习的研究，这些研究是用英语编写的，不是一本书，也不是一项调查。以covid为工具综合结果。结果：从336例记录中，51例纳入本综述。解释：大多数研究表明，主动学习可以对模型的构建产生积极的影响，然而，重要的是不仅要考虑样本的信息性/不确定性，还要考虑数据的分布，减少选择样本的概率，这些样本不足以代表数据集或异常值。主动学习通常是一个迭代过程，直到满足停止标准，例如，模型的性能。为了评估主动学习的解决方案，通常将所提出的方法与随机抽样或其他主动学习查询进行比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Procedia Computer Science

CiteScore

4.50

自引率

0.00%

发文量