我的图像有多粗糙?评估众包注释任务

2013 13th UK Workshop on Computational Intelligence (UKCI) Pub Date : 2013-10-31 DOI:10.1109/UKCI.2013.6651298

Hugo Hutt, R. Everson, Murray Grant, John Love, George R. Littlejohn

{"title":"我的图像有多粗糙?评估众包注释任务","authors":"Hugo Hutt, R. Everson, Murray Grant, John Love, George R. Littlejohn","doi":"10.1109/UKCI.2013.6651298","DOIUrl":null,"url":null,"abstract":"The use of citizen science to obtain annotations from multiple annotators has been shown to be an effective method for annotating datasets in which computational methods alone are not feasible. The way in which the annotations are obtained is an important consideration which affects the quality of the resulting consensus estimates. In this paper, we examine three separate approaches to obtaining scores for instances rather than merely classifications. To obtain a consensus score annotators were asked to make annotations in one of three paradigms: classification, scoring and ranking. A web-based citizen science experiment is described which implements the three approaches as crowdsourced annotation tasks. The tasks are evaluated in relation to the accuracy and agreement among the participants using both simulated and real-world data from the experiment. The results show a clear difference in performance between the three tasks, with the ranking task obtaining the highest accuracy and agreement among the participants. We show how a simple evolutionary optimiser may be used to improve the performance by reweighting the importance of annotators.","PeriodicalId":106191,"journal":{"name":"2013 13th UK Workshop on Computational Intelligence (UKCI)","volume":"243 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"How clumpy is my image? Evaluating crowdsourced annotation tasks\",\"authors\":\"Hugo Hutt, R. Everson, Murray Grant, John Love, George R. Littlejohn\",\"doi\":\"10.1109/UKCI.2013.6651298\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The use of citizen science to obtain annotations from multiple annotators has been shown to be an effective method for annotating datasets in which computational methods alone are not feasible. The way in which the annotations are obtained is an important consideration which affects the quality of the resulting consensus estimates. In this paper, we examine three separate approaches to obtaining scores for instances rather than merely classifications. To obtain a consensus score annotators were asked to make annotations in one of three paradigms: classification, scoring and ranking. A web-based citizen science experiment is described which implements the three approaches as crowdsourced annotation tasks. The tasks are evaluated in relation to the accuracy and agreement among the participants using both simulated and real-world data from the experiment. The results show a clear difference in performance between the three tasks, with the ranking task obtaining the highest accuracy and agreement among the participants. We show how a simple evolutionary optimiser may be used to improve the performance by reweighting the importance of annotators.\",\"PeriodicalId\":106191,\"journal\":{\"name\":\"2013 13th UK Workshop on Computational Intelligence (UKCI)\",\"volume\":\"243 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-10-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 13th UK Workshop on Computational Intelligence (UKCI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/UKCI.2013.6651298\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 13th UK Workshop on Computational Intelligence (UKCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UKCI.2013.6651298","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 11

摘要

使用公民科学从多个注释器中获取注释已被证明是对数据集进行注释的有效方法，其中单独使用计算方法是不可行的。获得注释的方式是一个重要的考虑因素，它会影响结果一致估计的质量。在本文中，我们研究了三种不同的方法来获得实例的分数，而不仅仅是分类。为了获得一个一致的分数，注释者被要求在三种范式中进行注释:分类、评分和排名。描述了一个基于网络的公民科学实验，该实验将这三种方法实现为众包注释任务。任务是评估的准确性和协议之间的参与者使用模拟和现实世界的数据从实验。结果表明，三个任务之间的表现存在明显差异，排名任务获得了最高的准确性和一致性。我们将展示如何使用一个简单的进化优化器通过重新加权注释器的重要性来提高性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

How clumpy is my image? Evaluating crowdsourced annotation tasks

The use of citizen science to obtain annotations from multiple annotators has been shown to be an effective method for annotating datasets in which computational methods alone are not feasible. The way in which the annotations are obtained is an important consideration which affects the quality of the resulting consensus estimates. In this paper, we examine three separate approaches to obtaining scores for instances rather than merely classifications. To obtain a consensus score annotators were asked to make annotations in one of three paradigms: classification, scoring and ranking. A web-based citizen science experiment is described which implements the three approaches as crowdsourced annotation tasks. The tasks are evaluated in relation to the accuracy and agreement among the participants using both simulated and real-world data from the experiment. The results show a clear difference in performance between the three tasks, with the ranking task obtaining the highest accuracy and agreement among the participants. We show how a simple evolutionary optimiser may be used to improve the performance by reweighting the importance of annotators.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2013 13th UK Workshop on Computational Intelligence (UKCI)

自引率

0.00%

发文量