Active labeling: Application to wireless endoscopy analysis

2012 International Conference on High Performance Computing & Simulation (HPCS) Pub Date : 2012-07-02 DOI:10.1109/HPCSIM.2012.6266908

P. Radeva, M. Drozdzal, S. Seguí, L. Igual, C. Malagelada, F. Azpiroz, Jordi Vitrià

{"title":"Active labeling: Application to wireless endoscopy analysis","authors":"P. Radeva, M. Drozdzal, S. Seguí, L. Igual, C. Malagelada, F. Azpiroz, Jordi Vitrià","doi":"10.1109/HPCSIM.2012.6266908","DOIUrl":null,"url":null,"abstract":"Today, robust learners trained in a real supervised machine learning application should count with a rich collection of positive and negative examples. Although in many applications, it is not difficult to obtain huge amount of data, labeling those data can be a very expensive process, especially when dealing with data of high variability and complexity. A good example of such cases are data from medical imaging applications where annotating anomalies like tumors, polyps, atherosclerotic plaque or informative frames in wireless endoscopy need highly trained experts. Building a representative set of training data from medical videos (e.g. Wireless Capsule Endoscopy) means that thousands of frames to be labeled by an expert. It is quite normal that data in new videos come different and thus are not represented by the training set. In this paper, we review the main approaches on active learning and illustrate how active learning can help to reduce expert effort in constructing the training sets. We show that applying active learning criteria, the number of human interventions can be significantly reduced. The proposed system allows the annotation of informative/non-informative frames of Wireless Capsule Endoscopy video containing more than 30000 frames each one with less than 100 expert ”clicks”.","PeriodicalId":428764,"journal":{"name":"2012 International Conference on High Performance Computing & Simulation (HPCS)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 International Conference on High Performance Computing & Simulation (HPCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HPCSIM.2012.6266908","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Today, robust learners trained in a real supervised machine learning application should count with a rich collection of positive and negative examples. Although in many applications, it is not difficult to obtain huge amount of data, labeling those data can be a very expensive process, especially when dealing with data of high variability and complexity. A good example of such cases are data from medical imaging applications where annotating anomalies like tumors, polyps, atherosclerotic plaque or informative frames in wireless endoscopy need highly trained experts. Building a representative set of training data from medical videos (e.g. Wireless Capsule Endoscopy) means that thousands of frames to be labeled by an expert. It is quite normal that data in new videos come different and thus are not represented by the training set. In this paper, we review the main approaches on active learning and illustrate how active learning can help to reduce expert effort in constructing the training sets. We show that applying active learning criteria, the number of human interventions can be significantly reduced. The proposed system allows the annotation of informative/non-informative frames of Wireless Capsule Endoscopy video containing more than 30000 frames each one with less than 100 expert ”clicks”.

查看原文本刊更多论文

主动标签:应用于无线内窥镜分析

今天，在真正的监督机器学习应用程序中训练的健壮的学习者应该具有丰富的正面和负面示例集合。尽管在许多应用程序中，获取大量数据并不困难，但标记这些数据可能是一个非常昂贵的过程，特别是在处理高度可变性和复杂性的数据时。这种情况的一个很好的例子是来自医学成像应用的数据，其中在无线内窥镜中注释肿瘤、息肉、动脉粥样硬化斑块或信息框架等异常需要训练有素的专家。从医学视频(如无线胶囊内窥镜)中构建具有代表性的训练数据集意味着专家需要对数千帧进行标记。新视频中的数据不同，因此不能用训练集表示，这是很正常的。在本文中，我们回顾了主动学习的主要方法，并说明了主动学习如何帮助减少专家构建训练集的工作量。我们表明，应用主动学习标准，人工干预的数量可以显着减少。所提出的系统允许在少于100次专家“点击”的情况下，对每帧超过30000帧的无线胶囊内窥镜视频进行信息/非信息帧的注释。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2012 International Conference on High Performance Computing & Simulation (HPCS)

自引率

0.00%

发文量