Sunghyun Park, Gelareh Mohammadi, Ron Artstein, Louis-Philippe Morency
{"title":"众包微级多媒体注释:评价与界面的挑战","authors":"Sunghyun Park, Gelareh Mohammadi, Ron Artstein, Louis-Philippe Morency","doi":"10.1145/2390803.2390816","DOIUrl":null,"url":null,"abstract":"This paper presents a new evaluation procedure and tool for crowdsourcing micro-level multimedia annotations and shows that such annotations can achieve a quality comparable to that of expert annotations. We propose a new evaluation procedure, called MM-Eval (Micro-level Multimedia Evaluation), which compares fine time-aligned annotations using Krippendorff's alpha metric and introduce two new metrics to evaluate the types of disagreement between coders. We also introduce OCTAB (Online Crowdsourcing Tool for Annotations of Behaviors), a web-based annotation tool that allows precise and convenient multimedia behavior annotations, directly from Amazon Mechanical Turk interface. With an experiment using the above tool and evaluation procedure, we show that a majority vote among annotations from 3 crowdsource workers leads to a quality comparable to that of local expert annotations.","PeriodicalId":429491,"journal":{"name":"CrowdMM '12","volume":"75 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"33","resultStr":"{\"title\":\"Crowdsourcing micro-level multimedia annotations: the challenges of evaluation and interface\",\"authors\":\"Sunghyun Park, Gelareh Mohammadi, Ron Artstein, Louis-Philippe Morency\",\"doi\":\"10.1145/2390803.2390816\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a new evaluation procedure and tool for crowdsourcing micro-level multimedia annotations and shows that such annotations can achieve a quality comparable to that of expert annotations. We propose a new evaluation procedure, called MM-Eval (Micro-level Multimedia Evaluation), which compares fine time-aligned annotations using Krippendorff's alpha metric and introduce two new metrics to evaluate the types of disagreement between coders. We also introduce OCTAB (Online Crowdsourcing Tool for Annotations of Behaviors), a web-based annotation tool that allows precise and convenient multimedia behavior annotations, directly from Amazon Mechanical Turk interface. With an experiment using the above tool and evaluation procedure, we show that a majority vote among annotations from 3 crowdsource workers leads to a quality comparable to that of local expert annotations.\",\"PeriodicalId\":429491,\"journal\":{\"name\":\"CrowdMM '12\",\"volume\":\"75 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-10-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"33\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"CrowdMM '12\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2390803.2390816\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"CrowdMM '12","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2390803.2390816","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 33
摘要
本文提出了一种新的众包微级多媒体注释评价方法和评价工具,并表明该方法可以达到与专家注释相当的质量。我们提出了一种新的评价方法,称为MM-Eval (Micro-level Multimedia evaluation),它使用Krippendorff的alpha度量来比较精细的时间排列注释,并引入了两个新的度量来评价编码员之间的分歧类型。我们还介绍了OCTAB (Online Crowdsourcing Tool for Annotations of Behaviors),这是一个基于web的注释工具,可以直接从Amazon Mechanical Turk界面进行精确和方便的多媒体行为注释。通过使用上述工具和评估程序的实验,我们表明,来自3名众包工作者的多数投票导致的注释质量与当地专家的注释相当。
Crowdsourcing micro-level multimedia annotations: the challenges of evaluation and interface
This paper presents a new evaluation procedure and tool for crowdsourcing micro-level multimedia annotations and shows that such annotations can achieve a quality comparable to that of expert annotations. We propose a new evaluation procedure, called MM-Eval (Micro-level Multimedia Evaluation), which compares fine time-aligned annotations using Krippendorff's alpha metric and introduce two new metrics to evaluate the types of disagreement between coders. We also introduce OCTAB (Online Crowdsourcing Tool for Annotations of Behaviors), a web-based annotation tool that allows precise and convenient multimedia behavior annotations, directly from Amazon Mechanical Turk interface. With an experiment using the above tool and evaluation procedure, we show that a majority vote among annotations from 3 crowdsource workers leads to a quality comparable to that of local expert annotations.