Crowdsourcing for Medical Image Classification

Swiss medical informatics Pub Date : 2014-09-27 DOI:10.4414/SMI.30.00319

A. G. S. D. Herrera, A. Foncubierta-Rodríguez, Dimitrios Markonis, Roger Schaer, H. Müller

{"title":"Crowdsourcing for Medical Image Classification","authors":"A. G. S. D. Herrera, A. Foncubierta-Rodríguez, Dimitrios Markonis, Roger Schaer, H. Müller","doi":"10.4414/SMI.30.00319","DOIUrl":null,"url":null,"abstract":"To help manage the large amount of biomedical images produced, image information retrieval tools have been developed to help access the right information at the right moment. To provide a test bed for image retrieval evaluation, the ImageCLEFmed benchmark proposes a biomedical classification task that automatically focuses on determining the image modality of figures from biomedical journal articles. In the training data for this machine learning task, some classes have many more images than others and thus a few classes are not well represented, which is a challenge for automatic image classification. To address this problem, an automatic training set expansion was first proposed. To improve the accuracy of the automatic training set expansion, a manual verification of the training set is done using the crowdsourcing platform Crowdflower. This platform allows the use of external persons to pay for the crowdsourcing or to use personal contacts free of charge. Crowdsourcing requires strict quality control or using trusted persons but it can quickly give access to a large number of judges and thus improve many machine learning tasks. Results show that the manual annotation of a large amount of biomedical images carried out in this project can help with image classification.","PeriodicalId":156842,"journal":{"name":"Swiss medical informatics","volume":"12 8","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"26","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Swiss medical informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4414/SMI.30.00319","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 26

Abstract

To help manage the large amount of biomedical images produced, image information retrieval tools have been developed to help access the right information at the right moment. To provide a test bed for image retrieval evaluation, the ImageCLEFmed benchmark proposes a biomedical classification task that automatically focuses on determining the image modality of figures from biomedical journal articles. In the training data for this machine learning task, some classes have many more images than others and thus a few classes are not well represented, which is a challenge for automatic image classification. To address this problem, an automatic training set expansion was first proposed. To improve the accuracy of the automatic training set expansion, a manual verification of the training set is done using the crowdsourcing platform Crowdflower. This platform allows the use of external persons to pay for the crowdsourcing or to use personal contacts free of charge. Crowdsourcing requires strict quality control or using trusted persons but it can quickly give access to a large number of judges and thus improve many machine learning tasks. Results show that the manual annotation of a large amount of biomedical images carried out in this project can help with image classification.

查看原文本刊更多论文

医学图像分类的众包

为了帮助管理产生的大量生物医学图像，已经开发了图像信息检索工具，以帮助在正确的时刻访问正确的信息。为了提供图像检索评估的测试平台，ImageCLEFmed基准提出了一个生物医学分类任务，该任务自动专注于确定生物医学期刊文章中图形的图像模式。在这个机器学习任务的训练数据中，有些类的图像比其他类的多，因此有一些类没有很好地表示，这对自动图像分类是一个挑战。为了解决这个问题，首先提出了一种自动训练集扩展方法。为了提高训练集自动扩展的准确性，使用众包平台Crowdflower对训练集进行人工验证。该平台允许使用外部人员支付众包费用或免费使用个人联系人。众包需要严格的质量控制或使用值得信赖的人，但它可以迅速获得大量的法官，从而改进许多机器学习任务。结果表明，本课题对大量生物医学图像进行人工标注，有助于图像分类。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Swiss medical informatics

自引率

0.00%

发文量